Search Posts

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 04/04/2025, 06:55:20 UTC

Quote from: nomachine on Today at 06:38:40 AM

Quote from: Akito S. M. Hosana on Today at 06:27:55 AM

maybe they're ghostin' us Roll Eyes

Probably busy updating their 'ignore list'—welcome to it. Grin

I am in the kitchen with "something.cu" Roll Eyes

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 03/04/2025, 14:39:04 UTC

Quote from: Geshma on Today at 02:21:05 PM

change the base key to 2A32ED54F2B4E35EE (Dec)

Thanks, My bad Roll Eyes

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 03/04/2025, 14:01:33 UTC

Quote from: nomachine on Today at 10:38:07 AM

Quote from: AlanJohnson on Today at 08:41:11 AM

Quote from: Akito S. M. Hosana on Today at 06:00:35 AM

Quote from: Bram24732 on Today at 05:26:30 AM

You might want to check the date of the post

You don't have to check the calendar for this topic at all. Almost every day feels like April Fools' Day.

I wonder how the 67 was solved. Was it someone very lucky or with access to insane computing power.

Maybe some kind of botnet cracking keys...

=======================================
== Mutagen Puzzle Solver by Denevron ==
=======================================
Starting puzzle: 66 (66-bit)
Target HASH160: 20d45a6a76...94335db8a5
Base Key: 0x2A32ED54F2B4E35EE
Flip count: 6 (override, default was 35)
Total Flips: 90858768
Using: 12 threads
Progress: 11.006092%
Processed: 10000000
Speed: 33.49 Mkeys/s
Elapsed Time: 00:00:02
=======================================
=========== SOLUTION FOUND ============
=======================================
Private key: 0x2832ED74F2B5E35EE
Checked 19165236 combinations
Bit flips: 6
Time: 4.68 seconds (00:00:04)
Speed: 20.20 Mkeys/s
Solution saved to puzzle_66_solution.txt

At least I know that if you have the correct base key for the right bit flip, Mutagen is proven to work—from Puzzle 66 to Puzzle 128. But you have to guess exactly which one it is. Grin

./mutagen -p 66 -t 12 -f 6
=======================================
== Mutagen Puzzle Solver by Denevron ==
=======================================
Starting puzzle: 66 (66-bit)
Target HASH160: 20d45a6a76...94335db8a5
Base Key: 0x1A838B13505B26867
Flip count: 6 (override, default was 35)
Total Flips: 90858768
Using: 12 threads
Progress: 100.000000%
Processed: 100000000
Speed: 42.03 Mkeys/s
Elapsed Time: 00:00:19

No solution found. Checked 100000709 combinations
Time: 19.99 seconds (00:00:19)
Speed: 42.03 Mkeys/s

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 03/04/2025, 00:32:15 UTC

Quote from: nomachine on April 02, 2025, 11:14:10 PM

Quote from: Akito S. M. Hosana on April 02, 2025, 11:02:01 PM

Quote from: MrHorse on April 02, 2025, 10:14:39 PM

It is working. I tried with -t {1..24} and observed

Code:

=== FOUND MATCH! ===

each time.

Imagine how fast this would be if he weren’t fishing. He doesn’t care. This is just entertainment for him when he’s full of money. Tongue

Your opinion on trust or my personal finances is irrelevant. The script's performance speaks for itself. Otherwise, unsolicited speculation about my motives or resources is just noise. . Contribute code or move on. Angry

Don't waste your time on people like him.

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 02/04/2025, 20:08:14 UTC

Quote from: nomachine on Today at 06:42:19 AM

New version uses AVX2 for fast partial hash comparisons

https://github.com/NoMachine1/Mutagen/blob/main/mutagen.cpp

First checks 4 bytes using AVX2

Quote

for (int j = 0; j < HASH_BATCH_SIZE; j++) {
// Load candidate hash
__m256i cand = _mm256_loadu_si256(reinterpret_cast<const __m256i*>(localHashResults[j]));

// Compare first 4 bytes using AVX2
__m256i cmp = _mm256_cmpeq_epi8(cand, target16);
int mask = _mm256_movemask_epi8(cmp);

if ((mask & 0x0F) == 0x0F) { // First 4 bytes match
// Full comparison
bool fullMatch = true;
for (int k = 0; k < 20; k++) {
if (localHashResults[j][k] != TARGET_HASH160_RAW[k]) {
fullMatch = false;
break;
}
}

if (fullMatch) {

Only does full comparison if first 4 bytes match.

I think I'll go fishing now. Wink

faster

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 23:05:42 UTC

Quote from: bibilgin on Today at 10:23:37 PM

Quote from: b0dre on Today at 10:18:05 PM

Yes, in CPU the speed blow up, Good Luck you too

In the meantime, I would like to post an article here. Maybe a few of our friends will read it.

https://www.quantamagazine.org/undergraduate-upends-a-40-year-old-data-science-conjecture-20250210/

Cool

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 22:18:05 UTC

Quote from: bibilgin on Today at 10:11:45 PM

Quote from: b0dre on Today at 08:49:50 PM

Welcome back Grin

Thanks man.

I read some of the pages back. I saw some good developments. Thanks to everyone who contributed.

I hope someone who really needs it finds it. Good Luck.

Yes, in CPU the speed blow up, Good Luck you too

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 22:02:41 UTC

Quote from: user_not_here on Today at 10:01:17 PM

Quote from: b0dre on Today at 09:59:07 PM

Quote from: user_not_here on Today at 09:57:02 PM

windows compiles ok..

Macos

ok, out of my skills

first compiled?

Yes and works!

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 22:01:46 UTC

Quote from: user_not_here on Today at 09:57:02 PM

windows compiles ok..

i dont get it, that part not changed

I will double check, I am using macOS Roll Eyes

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 21:59:07 UTC

Quote from: user_not_here on Today at 09:57:02 PM

windows compiles ok..

Macos

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 21:55:12 UTC

Quote from: user_not_here on Today at 09:52:12 PM

Quote

Error compiling!

Code:

#include <iostream>
#include <array>
#include <iomanip>
#include <fstream>
#include <sstream>
#include <vector>
#include <thread>
#include <atomic>
#include <chrono>
#include <queue>
#include <mutex>
#include <cstring>
#include <unordered_map>
#include <cmath>
#include <immintrin.h>
#include <omp.h>
#include <csignal>
#include <random>
#include <algorithm>
#include <getopt.h>

#ifdef _WIN32
#include <windows.h>
#else
#include <unistd.h>
#endif

// Include the required headers
#include "sha256_avx2.h"
#include "ripemd160_avx2.h"
#include "SECP256K1.h"
#include "Point.h"
#include "Int.h"
#include "IntGroup.h"

using namespace std;

// Cross-platform terminal functions
void initConsole() {
#ifdef _WIN32
HANDLE hConsole = GetStdHandle(STD_OUTPUT_HANDLE);
DWORD mode = 0;
GetConsoleMode(hConsole, &mode);
SetConsoleMode(hConsole, mode | ENABLE_VIRTUAL_TERMINAL_PROCESSING);
#endif
}

void clearTerminal() {
#ifdef _WIN32
HANDLE hStdOut = GetStdHandle(STD_OUTPUT_HANDLE);
COORD coord = {0, 0};
DWORD count;
CONSOLE_SCREEN_BUFFER_INFO csbi;
GetConsoleScreenBufferInfo(hStdOut, &csbi);
FillConsoleOutputCharacter(hStdOut, ' ', csbi.dwSize.X * csbi.dwSize.Y, coord, &count);
SetConsoleCursorPosition(hStdOut, coord);
#else
std::cout << "\033[2J\033[H";
#endif
std::cout.flush();
}

void moveCursorTo(int x, int y) {
#ifdef _WIN32
HANDLE hStdOut = GetStdHandle(STD_OUTPUT_HANDLE);
COORD coord = {(SHORT)x, (SHORT)y};
SetConsoleCursorPosition(hStdOut, coord);
#else
std::cout << "\033[" << y << ";" << x << "H";
#endif
std::cout.flush();
}

// Configuration defaults
int PUZZLE_NUM = 20;
int WORKERS = max(1, (int)thread::hardware_concurrency());
int FLIP_COUNT = -1;
const __uint128_t REPORT_INTERVAL = 10000000;
static constexpr int POINTS_BATCH_SIZE = 512;
static constexpr int HASH_BATCH_SIZE = 8;

// Historical puzzle data
const unordered_map<int, tuple<int, string, string>> PUZZLE_DATA = {
{20, {8, "b907c3a2a3b27789dfb509b730dd47703c272868", "357535"}},
{21, {9, "29a78213caa9eea824acf08022ab9dfc83414f56", "863317"}},
{22, {11, "7ff45303774ef7a52fffd8011981034b258cb86b", "1811764"}},
{23, {12, "d0a79df189fe1ad5c306cc70497b358415da579e", "3007503"}},
{24, {9, "0959e80121f36aea13b3bad361c15dac26189e2f", "5598802"}},
{25, {12, "2f396b29b27324300d0c59b17c3abc1835bd3dbb", "14428676"}},
{26, {14, "bfebb73562d4541b32a02ba664d140b5a574792f", "33185509"}},
{27, {13, "0c7aaf6caa7e5424b63d317f0f8f1f9fa40d5560", "54538862"}},
{28, {16, "1306b9e4ff56513a476841bac7ba48d69516b1da", "111949941"}},
{29, {18, "5a416cc9148f4a377b672c8ae5d3287adaafadec", "227634408"}},
{30, {16, "d39c4704664e1deb76c9331e637564c257d68a08", "400708894"}},
{31, {13, "d805f6f251f7479ebd853b3d0f4b9b2656d92f1d", "1033162084"}},
{32, {14, "9e42601eeaedc244e15f17375adb0e2cd08efdc9", "2102388551"}},
{33, {15, "4e15e5189752d1eaf444dfd6bff399feb0443977", "3093472814"}},
{34, {16, "f6d67d7983bf70450f295c9cb828daab265f1bfa", "7137437912"}},
{35, {19, "f6d8ce225ffbdecec170f8298c3fc28ae686df25", "14133072157"}},
{36, {14, "74b1e012be1521e5d8d75e745a26ced845ea3d37", "20112871792"}},
{37, {23, "28c30fb11ed1da72e7c4f89c0164756e8a021d", "42387769980"}},
{38, {21, "b190e2d40cfdeee2cee072954a2be89e7ba39364", "100251560595"}},
{39, {23, "0b304f2a79a027270276533fe1ed4eff30910876", "146971536592"}},
{40, {20, "95a156cd21b4a69de969eb6716864f4c8b82a82a", "323724968937"}},
{41, {25, "d1562eb37357f9e6fc41cb2359f4d3eda4032329", "1003651412950"}},
{42, {24, "8efb85f9c5b5db2d55973a04128dc7510075ae23", "1458252205147"}},
{43, {19, "f92044c7924e5525c61207972c253c9fc9f086f7", "2895374552463"}},
{44, {24, "80df54e1f612f2fc5bdc05c9d21a83aa8d20791e", "7409811047825"}},
{45, {21, "f0225bfc68a6e17e87cd8b5e60ae3be18f120753", "15404761757071"}},
{46, {24, "9a012260d01c5113df66c8a8438c9f7a1e3d5dac", "19996463086597"}},
{47, {27, "f828005d41b0f4fed4c8dca3b06011072cfb07d4", "51408670348612"}},
{48, {21, "8661cb56d9df0a61f01328b55af7e56a3fe7a2b2", "119666659114170"}},
{49, {30, "0d2f533966c6578e1111978ca698f8add7fffdf3", "191206974700443"}},
{50, {29, "de081b76f840e462fa2cdf360173dfaf4a976a47", "409118905032525"}},
{51, {25, "ef6419cffd7fad7027994354eb8efae223c2dbe7", "611140496167764"}},
{52, {27, "36af659edbe94453f6344e920d143f1778653ae7", "2058769515153876"}},
{53, {26, "2f4870ef54fa4b048c1365d42594cc7d3d269551", "4216495639600700"}},
{54, {30, "cb66763cf7fde659869ae7f06884d9a0f879a092", "6763683971478124"}},
{55, {31, "db53d9bbd1f3a83b094eeca7dd970bd85b492fa2", "9974455244496707"}},
{56, {31, "48214c5969ae9f43f75070cea1e2cb41d5bdcccd", "30045390491869460"}},
{57, {33, "328660ef43f66abe2653fa178452a5dfc594c2a1", "44218742292676575"}},
{58, {28, "8c2a6071f89c90c4dab5ab295d7729d1b54ea60f", "138245758910846492"}},
{59, {30, "b14ed3146f5b2c9bde1703deae9ef33af8110210", "199976667976342049"}},
{60, {31, "cdf8e5c7503a9d22642e3ecfc87817672787b9c5", "525070384258266191"}},
{61, {25, "68133e19b2dfb9034edf9830a200cfdf38c90cbd", "1135041350219496382"}},
{62, {35, "e26646db84b0602f32b34b5a62ca3cae1f91b779", "1425787542618654982"}},
{63, {34, "ef58afb697b094423ce90721fbb19a359ef7c50e", "3908372542507822062"}},
{64, {34, "3ee4133d991f52fdf6a25c9834e0745ac74248a4", "8993229949524469768"}},
{65, {37, "52e763a7ddc1aa4fa811578c491c1bc7fd570137", "17799667357578236628"}},
{66, {35, "20d45a6a762535700ce9e0b216e31994335db8a5", "30568377312064202855"}},
{67, {31, "739437bb3dd6d1983e66629c5f08c70e52769371", "46346217550346335726"}},
{68, {34, "e0b8a2baee1b77fc703455f39d51477451fc8cfc", "132656943602386256302"}}
};

// Global variables
vector<unsigned char> TARGET_HASH160_RAW(20);
string TARGET_HASH160;
Int BASE_KEY;
atomic<bool> stop_event(false);
mutex result_mutex;
queue<tuple<string, __uint128_t, int>> results;

// 128-bit counter
atomic<uint64_t> total_checked_high(0);
atomic<uint64_t> total_checked_low(0);
__uint128_t total_combinations = 0;
vector<string> g_threadPrivateKeys;
mutex progress_mutex;

// Performance tracking
atomic<uint64_t> globalComparedCount(0);
atomic<uint64_t> localComparedCount(0);
double globalElapsedTime = 0.0;
double mkeysPerSec = 0.0;
chrono::time_point<chrono::high_resolution_clock> tStart;

// Helper functions
__uint128_t load_128() {
return (static_cast<__uint128_t>(total_checked_high.load()) << 64) | total_checked_low.load();
}

void increment_128() {
uint64_t old_low = total_checked_low.fetch_add(1);
if (old_low == UINT64_MAX) {
total_checked_high.fetch_add(1);
}
}

static string formatElapsedTime(double seconds) {
int hrs = static_cast<int>(seconds) / 3600;
int mins = (static_cast<int>(seconds) % 3600) / 60;
int secs = static_cast<int>(seconds) % 60;
ostringstream oss;
oss << setw(2) << setfill('0') << hrs << ":"
<< setw(2) << setfill('0') << mins << ":"
<< setw(2) << setfill('0') << secs;
return oss.str();
}

static string to_string_128(__uint128_t value) {
if (value == 0) return "0";
char buffer[50];
char* p = buffer + sizeof(buffer);
*--p = '\0';
while (value != 0) {
*--p = "0123456789"[value % 10];
value /= 10;
}
return p;
}

void signalHandler(int signum) {
stop_event.store(true);
cout << "\nInterrupt received, shutting down...\n";
}

class CombinationGenerator {
int n, k;
vector<int> current;
mt19937_64 rng;

public:
CombinationGenerator(int n, int k, uint64_t seed = 0) : n(n), k(k), current(k), rng(seed) {
if (k > n) k = n;
for (int i = 0; i < k; ++i) current[i] = i;
}

static __uint128_t combinations_count(int n, int k) {
if (k > n) return 0;
if (k * 2 > n) k = n - k;
if (k == 0) return 1;

__uint128_t result = n;
for(int i = 2; i <= k; ++i) {
result *= (n - i + 1);
result /= i;
}
return result;
}

const vector<int>& get() const { return current; }

bool next() {
int i = k - 1;
while (i >= 0 && current[i] == n - k + i) --i;
if (i < 0) return false;
++current[i];
for (int j = i + 1; j < k; ++j)
current[j] = current[j-1] + 1;
return true;
}

void unrank(__uint128_t rank) {
__uint128_t total = combinations_count(n, k);
if (rank >= total) {
current.clear();
return;
}

current.resize(k);
__uint128_t remaining_rank = rank;
int a = n;
int b = k;
__uint128_t x = (total - 1) - rank;

for (int i = 0; i < k; i++) {
a = largest_a_where_comb_a_b_le_x(a, b, x);
current[i] = (n - 1) - a;
x -= combinations_count(a, b);
b--;
}
}

private:
int largest_a_where_comb_a_b_le_x(int a, int b, __uint128_t x) const {
while (a >= b && combinations_count(a, b) > x) a--;
return a;
}
};

static void computeHash160BatchBinSingle(int numKeys,
uint8_t pubKeys[][33],
uint8_t hashResults[][20])
{
alignas(32) array<array<uint8_t, 64>, HASH_BATCH_SIZE> shaInputs;
alignas(32) array<array<uint8_t, 32>, HASH_BATCH_SIZE> shaOutputs;
alignas(32) array<array<uint8_t, 64>, HASH_BATCH_SIZE> ripemdInputs;
alignas(32) array<array<uint8_t, 20>, HASH_BATCH_SIZE> ripemdOutputs;

const __uint128_t totalBatches = (numKeys + (HASH_BATCH_SIZE - 1)) / HASH_BATCH_SIZE;

for (__uint128_t batch = 0; batch < totalBatches; batch++) {
const __uint128_t batchCount = min<__uint128_t>(HASH_BATCH_SIZE, numKeys - batch * HASH_BATCH_SIZE);

// Prepare SHA-256 input blocks
for (__uint128_t i = 0; i < batchCount; i++) {
memset(shaInputs[i].data(), 0, 64);
memcpy(shaInputs[i].data(), pubKeys[batch * HASH_BATCH_SIZE + i], 33);
shaInputs[i][33] = 0x80;
*reinterpret_cast<uint32_t*>(&shaInputs[i][60]) = _byteswap_ulong(33 * 8);
}

if (batchCount < HASH_BATCH_SIZE) {
static array<uint8_t, 64> shaPadding = {};
memset(shaPadding.data(), 0, 64);
memcpy(shaPadding.data(), pubKeys[0], 33);
shaPadding[33] = 0x80;
*reinterpret_cast<uint32_t*>(&shaPadding[60]) = _byteswap_ulong(33 * 8);
for (__uint128_t i = batchCount; i < HASH_BATCH_SIZE; i++) {
memcpy(shaInputs[i].data(), shaPadding.data(), 64);
}
}

uint8_t* inPtr[HASH_BATCH_SIZE];
uint8_t* outPtr[HASH_BATCH_SIZE];
for (int i = 0; i < HASH_BATCH_SIZE; i++) {
inPtr[i] = shaInputs[i].data();
outPtr[i] = shaOutputs[i].data();
}

sha256avx2_8B(inPtr[0], inPtr[1], inPtr[2], inPtr[3],
inPtr[4], inPtr[5], inPtr[6], inPtr[7],
outPtr[0], outPtr[1], outPtr[2], outPtr[3],
outPtr[4], outPtr[5], outPtr[6], outPtr[7]);

for (__uint128_t i = 0; i < batchCount; i++) {
memset(ripemdInputs[i].data(), 0, 64);
memcpy(ripemdInputs[i].data(), shaOutputs[i].data(), 32);
ripemdInputs[i][32] = 0x80;
*reinterpret_cast<uint32_t*>(&ripemdInputs[i][60]) = _byteswap_ulong(256);
}

if (batchCount < HASH_BATCH_SIZE) {
static array<uint8_t, 64> ripemdPadding = {};
memset(ripemdPadding.data(), 0, 64);
memcpy(ripemdPadding.data(), shaOutputs[0].data(), 32);
ripemdPadding[32] = 0x80;
*reinterpret_cast<uint32_t*>(&ripemdPadding[60]) = _byteswap_ulong(256);
for (__uint128_t i = batchCount; i < HASH_BATCH_SIZE; i++) {
memcpy(ripemdInputs[i].data(), ripemdPadding.data(), 64);
}
}

for (int i = 0; i < HASH_BATCH_SIZE; i++) {
inPtr[i] = ripemdInputs[i].data();
outPtr[i] = ripemdOutputs[i].data();
}

ripemd160avx2::ripemd160avx2_32(
inPtr[0], inPtr[1], inPtr[2], inPtr[3],
inPtr[4], inPtr[5], inPtr[6], inPtr[7],
outPtr[0], outPtr[1], outPtr[2], outPtr[3],
outPtr[4], outPtr[5], outPtr[6], outPtr[7]
);

for (__uint128_t i = 0; i < batchCount; i++) {
memcpy(hashResults[batch * HASH_BATCH_SIZE + i], ripemdOutputs[i].data(), 20);
}
}
}

void worker(Secp256K1* secp, int bit_length, int flip_count, int threadId, __uint128_t start, __uint128_t end) {
const int fullBatchSize = 2 * POINTS_BATCH_SIZE;
uint8_t localPubKeys[HASH_BATCH_SIZE][33];
uint8_t localHashResults[HASH_BATCH_SIZE][20];
int pointIndices[HASH_BATCH_SIZE];

// Precompute points
vector<Point> plusPoints(POINTS_BATCH_SIZE);
vector<Point> minusPoints(POINTS_BATCH_SIZE);
for (int i = 0; i < POINTS_BATCH_SIZE; i++) {
Int tmp; tmp.SetInt32(i);
plusPoints[i] = secp->ComputePublicKey(&tmp);
minusPoints[i] = plusPoints[i];
minusPoints[i].y.ModNeg();
}

// Structure of Arrays
vector<Int> deltaX(POINTS_BATCH_SIZE);
IntGroup modGroup(POINTS_BATCH_SIZE);
vector<Int> pointBatchX(fullBatchSize);
vector<Int> pointBatchY(fullBatchSize);

// Random number generation
random_device rd;
mt19937_64 rng(rd() + threadId);
uniform_int_distribution<uint64_t> dist(0, UINT64_MAX);

CombinationGenerator gen(bit_length, flip_count, rd() + threadId);
gen.unrank(start);

for (__uint128_t count = start; !stop_event.load() && count < end; ) {
Int currentKey;
currentKey.Set(&BASE_KEY);

const vector<int>& flips = gen.get();

// Apply flips
for (int pos : flips) {
Int mask; mask.SetInt32(1); mask.ShiftL(pos);
Int temp; temp.Set(&currentKey); temp.ShiftR(pos);
temp.IsEven() ? currentKey.Add(&mask) : currentKey.Sub(&mask);
}

// Store private key
string keyStr = currentKey.GetBase16();
keyStr = string(64 - keyStr.length(), '0') + keyStr;
#pragma omp critical
g_threadPrivateKeys[threadId] = keyStr;

// Compute public key
Point startPoint = secp->ComputePublicKey(&currentKey);
Int startPointX = startPoint.x, startPointY = startPoint.y, startPointXNeg = startPointX;
startPointXNeg.ModNeg();

// Compute deltaX values
for (int i = 0; i < POINTS_BATCH_SIZE; i += 4) {
deltaX[i].ModSub(&plusPoints[i].x, &startPointX);
deltaX[i+1].ModSub(&plusPoints[i+1].x, &startPointX);
deltaX[i+2].ModSub(&plusPoints[i+2].x, &startPointX);
deltaX[i+3].ModSub(&plusPoints[i+3].x, &startPointX);
}
modGroup.Set(deltaX.data());
modGroup.ModInv();

// Process points
for (int i = 0; i < POINTS_BATCH_SIZE; i += 4) {
for (int j = 0; j < 4; j++) {
// Plus points
Int deltaY; deltaY.ModSub(&plusPoints[i+j].y, &startPointY);
Int slope; slope.ModMulK1(&deltaY, &deltaX[i+j]);
Int slopeSq; slopeSq.ModSquareK1(&slope);

pointBatchX[i+j].Set(&startPointXNeg);
pointBatchX[i+j].ModAdd(&slopeSq);
pointBatchX[i+j].ModSub(&plusPoints[i+j].x);

Int diffX; diffX.ModSub(&startPointX, &pointBatchX[i+j]);
diffX.ModMulK1(&slope);

pointBatchY[i+j].Set(&startPointY);
pointBatchY[i+j].ModNeg();
pointBatchY[i+j].ModAdd(&diffX);

// Minus points
deltaY.ModSub(&minusPoints[i+j].y, &startPointY);
slope.ModMulK1(&deltaY, &deltaX[i+j]);
slopeSq.ModSquareK1(&slope);

pointBatchX[POINTS_BATCH_SIZE+i+j].Set(&startPointXNeg);
pointBatchX[POINTS_BATCH_SIZE+i+j].ModAdd(&slopeSq);
pointBatchX[POINTS_BATCH_SIZE+i+j].ModSub(&minusPoints[i+j].x);

diffX.ModSub(&startPointX, &pointBatchX[POINTS_BATCH_SIZE+i+j]);
diffX.ModMulK1(&slope);

pointBatchY[POINTS_BATCH_SIZE+i+j].Set(&startPointY);
pointBatchY[POINTS_BATCH_SIZE+i+j].ModNeg();
pointBatchY[POINTS_BATCH_SIZE+i+j].ModAdd(&diffX);
}
}

// Process keys in batches
int localBatchCount = 0;
for (int i = 0; i < fullBatchSize && localBatchCount < HASH_BATCH_SIZE; i++) {
Point tempPoint;
tempPoint.x.Set(&pointBatchX[i]);
tempPoint.y.Set(&pointBatchY[i]);

// Convert to compressed public key
localPubKeys[localBatchCount][0] = tempPoint.y.IsEven() ? 0x02 : 0x03;
for (int j = 0; j < 32; j++) {
localPubKeys[localBatchCount][1 + j] = pointBatchX[i].GetByte(31 - j);
}
pointIndices[localBatchCount] = i;
localBatchCount++;

if (localBatchCount == HASH_BATCH_SIZE) {
computeHash160BatchBinSingle(localBatchCount, localPubKeys, localHashResults);
localComparedCount += HASH_BATCH_SIZE;

__m256i target = _mm256_loadu_si256(reinterpret_cast<const __m256i*>(TARGET_HASH160_RAW.data()));

for (int j = 0; j < HASH_BATCH_SIZE; j++) {
__m256i result = _mm256_loadu_si256(reinterpret_cast<const __m256i*>(localHashResults[j]));

int mask = _mm256_movemask_epi8(_mm256_cmpeq_epi8(result, target));
const int HASH160_MASK = (1 << 20) - 1;

if ((mask & HASH160_MASK) == HASH160_MASK) {
bool fullMatch = true;
for (int k = 0; k < 20; k++) {
if (localHashResults[j][k] != TARGET_HASH160_RAW[k]) {
fullMatch = false;
break;
}
}

if (fullMatch) {
auto tEndTime = chrono::high_resolution_clock::now();
globalElapsedTime = chrono::duration<double>(tEndTime - tStart).count();
mkeysPerSec = (double)(globalComparedCount + localComparedCount) / globalElapsedTime / 1e6;

Int foundKey; foundKey.Set(&currentKey);
int idx = pointIndices[j];
if (idx < POINTS_BATCH_SIZE) {
Int offset; offset.SetInt32(idx);
foundKey.Add(&offset);
} else {
Int offset; offset.SetInt32(idx - POINTS_BATCH_SIZE);
foundKey.Sub(&offset);
}

string hexKey = foundKey.GetBase16();
hexKey = string(64 - hexKey.length(), '0') + hexKey;

lock_guard<mutex> lock(result_mutex);
results.push(make_tuple(hexKey, load_128(), flip_count));
stop_event.store(true);
return;
}
}
}

increment_128();
localBatchCount = 0;

// Progress reporting
__uint128_t current_total = load_128();
if (current_total % REPORT_INTERVAL == 0 || count == end - 1) {
auto now = chrono::high_resolution_clock::now();
globalElapsedTime = chrono::duration<double>(now - tStart).count();

globalComparedCount += localComparedCount;
localComparedCount = 0;
mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

double progress = min(100.0, (double)current_total / total_combinations * 100.0);

lock_guard<mutex> lock(progress_mutex);
moveCursorTo(0, 9);
cout << "Progress: " << fixed << setprecision(6) << progress << "%\n";
cout << "Processed: " << to_string_128(current_total) << "\n";
cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";
cout << "Elapsed Time: " << formatElapsedTime(globalElapsedTime) << "\n";
cout.flush();

if (current_total >= total_combinations) {
stop_event.store(true);
break;
}
}
}
}

if (!gen.next()) break;
count++;
}

if (!stop_event.load() && load_128() >= total_combinations) {
stop_event.store(true);
}
}

void printUsage(const char* programName) {
cout << "Usage: " << programName << " [options]\n";
cout << "Options:\n";
cout << " -p, --puzzle NUM Puzzle number to solve (default: 20)\n";
cout << " -t, --threads NUM Number of CPU cores to use (default: all)\n";
cout << " -f, --flips NUM Override default flip count for puzzle\n";
cout << " -h, --help Show this help message\n";
cout << "\nExample:\n";
cout << " " << programName << " -p 38 -t 8 -f 21\n";
}

int main(int argc, char* argv[]) {
signal(SIGINT, signalHandler);

// Parse command line arguments
int opt;
int option_index = 0;
static struct option long_options[] = {
{"puzzle", required_argument, 0, 'p'},
{"threads", required_argument, 0, 't'},
{"flips", required_argument, 0, 'f'},
{"help", no_argument, 0, 'h'},
{0, 0, 0, 0}
};

while ((opt = getopt_long(argc, argv, "p:t:f:h", long_options, &option_index)) != -1) {
switch (opt) {
case 'p':
PUZZLE_NUM = atoi(optarg);
if (PUZZLE_NUM < 20 || PUZZLE_NUM > 68) {
cerr << "Error: Puzzle number must be between 20 and 68\n";
return 1;
}
break;
case 't':
WORKERS = atoi(optarg);
if (WORKERS < 1) {
cerr << "Error: Thread count must be at least 1\n";
return 1;
}
break;
case 'f':
FLIP_COUNT = atoi(optarg);
if (FLIP_COUNT < 1) {
cerr << "Error: Flip count must be at least 1\n";
return 1;
}
break;
case 'h':
printUsage(argv[0]);
return 0;
default:
printUsage(argv[0]);
return 1;
}
}

tStart = chrono::high_resolution_clock::now();

Secp256K1 secp;
secp.Init();

auto puzzle_it = PUZZLE_DATA.find(PUZZLE_NUM);
if (puzzle_it == PUZZLE_DATA.end()) {
cerr << "Error: Invalid puzzle number\n";
return 1;
}

auto [DEFAULT_FLIP_COUNT, TARGET_HASH160_HEX, PRIVATE_KEY_DECIMAL] = puzzle_it->second;

if (FLIP_COUNT == -1) FLIP_COUNT = DEFAULT_FLIP_COUNT;
TARGET_HASH160 = TARGET_HASH160_HEX;

// Convert target hash to bytes
for (__uint128_t i = 0; i < 20; i++) {
TARGET_HASH160_RAW[i] = stoul(TARGET_HASH160.substr(i * 2, 2), nullptr, 16);
}

// Set base key
BASE_KEY.SetBase10(const_cast<char*>(PRIVATE_KEY_DECIMAL.c_str()));

// Verify base key
Int testKey;
testKey.SetBase10(const_cast<char*>(PRIVATE_KEY_DECIMAL.c_str()));
if (!testKey.IsEqual(&BASE_KEY)) {
cerr << "Base key initialization failed!\n";
return 1;
}

if (BASE_KEY.GetBitLength() > PUZZLE_NUM) {
cerr << "Base key exceeds puzzle bit length!\n";
return 1;
}

// Calculate total combinations
total_combinations = CombinationGenerator::combinations_count(PUZZLE_NUM, FLIP_COUNT);

// Format base key for display
string paddedKey = BASE_KEY.GetBase16();
size_t firstNonZero = paddedKey.find_first_not_of('0');
paddedKey = paddedKey.substr(firstNonZero);
paddedKey = "0x" + paddedKey;

// Print initial header
clearTerminal();
cout << "=======================================\n";
cout << "== Mutagen Puzzle Solver by Denevron ==\n";
cout << "=======================================\n";
cout << "Starting puzzle: " << PUZZLE_NUM << " (" << PUZZLE_NUM << "-bit)\n";
cout << "Target HASH160: " << TARGET_HASH160.substr(0, 10) << "..." << TARGET_HASH160.substr(TARGET_HASH160.length()-10) << "\n";
cout << "Base Key: " << paddedKey << "\n";
cout << "Flip count: " << FLIP_COUNT << " ";
if (FLIP_COUNT != DEFAULT_FLIP_COUNT) {
cout << "(override, default was " << DEFAULT_FLIP_COUNT << ")";
}
cout << "\n";
cout << "Total combinations for C(" << PUZZLE_NUM << "," << FLIP_COUNT << "): " << to_string_128(total_combinations) << "\n";
cout << "Using: " << WORKERS << " threads\n";
// Print empty lines for progress display
cout << "Progress: 0.000000%\n";
cout << "Processed: 0\n";
cout << "Speed: 0.00 Mkeys/s\n";
cout << "Elapsed Time: 00:00:00\n";

g_threadPrivateKeys.resize(WORKERS, "0");
vector<thread> threads;

// Distribute work
__uint128_t comb_per_thread = total_combinations / WORKERS;
__uint128_t remainder = total_combinations % WORKERS;

for (int i = 0; i < WORKERS; i++) {
__uint128_t start = i * comb_per_thread + min((__uint128_t)i, remainder);
__uint128_t end = start + comb_per_thread + (i < remainder ? 1 : 0);
threads.emplace_back(worker, &secp, PUZZLE_NUM, FLIP_COUNT, i, start, end);
}

for (auto& t : threads) {
if (t.joinable()) t.join();
}

if (!results.empty()) {
auto [hex_key, checked, flips] = results.front();
globalElapsedTime = chrono::duration<double>(chrono::high_resolution_clock::now() - tStart).count();
mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

string compactHex = hex_key;
size_t firstNonZero = compactHex.find_first_not_of('0');
compactHex = "0x" + compactHex.substr(firstNonZero);

cout << "=======================================\n";
cout << "=========== SOLUTION FOUND ============\n";
cout << "=======================================\n";
cout << "Private key: " << compactHex << "\n";
cout << "Checked " << to_string_128(checked) << " combinations\n";
cout << "Bit flips: " << flips << endl;
cout << "Time: " << fixed << setprecision(2) << globalElapsedTime << " seconds ("
<< formatElapsedTime(globalElapsedTime) << ")\n";
cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";

// Save solution
ofstream out("puzzle_" + to_string(PUZZLE_NUM) + "_solution.txt");
if (out) {
out << hex_key;
out.close();
cout << "Solution saved to puzzle_" << PUZZLE_NUM << "_solution.txt\n";
} else {
cerr << "Failed to save solution to file!\n";
}
} else {
__uint128_t final_count = load_128();
globalElapsedTime = chrono::duration<double>(chrono::high_resolution_clock::now() - tStart).count();
mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

cout << "\n\nNo solution found. Checked " << to_string_128(load_128()) << " combinations\n";
cout << "Time: " << fixed << setprecision(2) << globalElapsedTime << " seconds ("
<< formatElapsedTime(globalElapsedTime) << ")\n";
cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";
}

return 0;
}

test again

Code:

mutagen.cpp: In function 'void computeHash160BatchBinSingle(int, uint8_t (*)[33], uint8_t (*)[20])':
mutagen.cpp:280:63: error: '_byteswap_ulong' was not declared in this scope; did you mean '_byteswap_uint64'?
280 | *reinterpret_cast<uint32_t*>(&shaInputs[i][60]) = _byteswap_ulong(33 * 8);
| ^~~~~~~~~~~~~~~
| _byteswap_uint64
mutagen.cpp:288:61: error: '_byteswap_ulong' was not declared in this scope; did you mean '_byteswap_uint64'?
288 | *reinterpret_cast<uint32_t*>(&shaPadding[60]) = _byteswap_ulong(33 * 8);
| ^~~~~~~~~~~~~~~
| _byteswap_uint64
mutagen.cpp:310:66: error: '_byteswap_ulong' was not declared in this scope; did you mean '_byteswap_uint64'?
310 | *reinterpret_cast<uint32_t*>(&ripemdInputs[i][60]) = _byteswap_ulong(256);
| ^~~~~~~~~~~~~~~
| _byteswap_uint64
mutagen.cpp:318:64: error: '_byteswap_ulong' was not declared in this scope; did you mean '_byteswap_uint64'?
318 | *reinterpret_cast<uint32_t*>(&ripemdPadding[60]) = _byteswap_ulong(256);
| ^~~~~~~~~~~~~~~
| _byteswap_uint64

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 21:52:44 UTC

Quote from: nomachine on Today at 09:48:04 PM

Quote from: Denevron on Today at 07:29:38 PM

Quote from: nomachine on Today at 06:44:49 PM

Quote from: Denevron on Today at 06:20:58 PM

so there will still be 28453041475240576740 combinations, not 47478523248093572 Grin

Yes, if the script uses 128-bit counters and 128-bit arithmetic. After that, the next step is optimizing this version for AVX2. Grin

It will be interesting to see)

I've added a check for a max of 6 bytes for now, to discard unsuitable options faster)

https://github.com/NoMachine1/Mutagen/

sequential?

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 21:45:50 UTC

Quote from: user_not_here on Today at 09:34:18 PM

Random mutagen V3

Build on Commit b97c178 fixed for Cross-platform terminal functions ( NoMachine1 / Mutagen )

Key changes made:

   Added random number generation to the CombinationGenerator class with a thread-specific seed

   Modified the worker function to:
   Generate random combinations within its assigned range using a uniform distribution
   Use a thread-local random number generator with a unique seed
   Process combinations in random order rather than sequentially

   The CombinationGenerator now has:
   A random() method to generate random combinations
   Thread-safe random number generation with proper seeding
   Fisher-Yates shuffle for generating random combinations

   Each worker thread now:
   Gets its own range of combinations to check
   Processes them in random order within that range
   Uses a unique seed for its random number generator

Code:

#include <iostream>
#include <array>
#include <iomanip>
#include <fstream>
#include <sstream>
#include <vector>
#include <thread>
#include <atomic>
#include <chrono>
#include <queue>
#include <mutex>
#include <cstring>
#include <unordered_map>
#include <cmath>
#include <immintrin.h>
#include <omp.h>
#include <csignal>
#include <random>
#include <algorithm>
#include <getopt.h>

#ifdef _WIN32
   #include <windows.h>
#else
   #include <unistd.h>
#endif

// Include the required headers
#include "sha256_avx2.h"
#include "ripemd160_avx2.h"
#include "SECP256K1.h"
#include "Point.h"
#include "Int.h"
#include "IntGroup.h"

using namespace std;

// Cross-platform terminal functions
void initConsole() {
#ifdef _WIN32
   HANDLE hConsole = GetStdHandle(STD_OUTPUT_HANDLE);
   DWORD mode = 0;
   GetConsoleMode(hConsole, &mode);
   SetConsoleMode(hConsole, mode | ENABLE_VIRTUAL_TERMINAL_PROCESSING);
#endif
}

void clearTerminal() {
#ifdef _WIN32
   HANDLE hStdOut = GetStdHandle(STD_OUTPUT_HANDLE);
   COORD coord = {0, 0};
   DWORD count;
   CONSOLE_SCREEN_BUFFER_INFO csbi;
   GetConsoleScreenBufferInfo(hStdOut, &csbi);
   FillConsoleOutputCharacter(hStdOut, ' ', csbi.dwSize.X * csbi.dwSize.Y, coord, &count);
   SetConsoleCursorPosition(hStdOut, coord);
#else
   std::cout << "\033[2J\033[H";
#endif
   std::cout.flush();
}

void moveCursorTo(int x, int y) {
#ifdef _WIN32
   HANDLE hStdOut = GetStdHandle(STD_OUTPUT_HANDLE);
   COORD coord = {(SHORT)x, (SHORT)y};
   SetConsoleCursorPosition(hStdOut, coord);
#else
   std::cout << "\033[" << y << ";" << x << "H";
#endif
   std::cout.flush();
}

// Configuration defaults
int PUZZLE_NUM = 20;
int WORKERS = max(1, (int)thread::hardware_concurrency());
int FLIP_COUNT = -1;
const __uint128_t REPORT_INTERVAL = 10000000;
static constexpr int POINTS_BATCH_SIZE = 512;
static constexpr int HASH_BATCH_SIZE = 8;

// Historical puzzle data
const unordered_map<int, tuple<int, string, string>> PUZZLE_DATA = {
   {20, {8, "b907c3a2a3b27789dfb509b730dd47703c272868", "357535"}},
   {21, {9, "29a78213caa9eea824acf08022ab9dfc83414f56", "863317"}},
   {22, {11, "7ff45303774ef7a52fffd8011981034b258cb86b", "1811764"}},
   {23, {12, "d0a79df189fe1ad5c306cc70497b358415da579e", "3007503"}},
   {24, {9, "0959e80121f36aea13b3bad361c15dac26189e2f", "5598802"}},
   {25, {12, "2f396b29b27324300d0c59b17c3abc1835bd3dbb", "14428676"}},
   {26, {14, "bfebb73562d4541b32a02ba664d140b5a574792f", "33185509"}},
   {27, {13, "0c7aaf6caa7e5424b63d317f0f8f1f9fa40d5560", "54538862"}},
   {28, {16, "1306b9e4ff56513a476841bac7ba48d69516b1da", "111949941"}},
   {29, {18, "5a416cc9148f4a377b672c8ae5d3287adaafadec", "227634408"}},
   {30, {16, "d39c4704664e1deb76c9331e637564c257d68a08", "400708894"}},
   {31, {13, "d805f6f251f7479ebd853b3d0f4b9b2656d92f1d", "1033162084"}},
   {32, {14, "9e42601eeaedc244e15f17375adb0e2cd08efdc9", "2102388551"}},
   {33, {15, "4e15e5189752d1eaf444dfd6bff399feb0443977", "3093472814"}},
   {34, {16, "f6d67d7983bf70450f295c9cb828daab265f1bfa", "7137437912"}},
   {35, {19, "f6d8ce225ffbdecec170f8298c3fc28ae686df25", "14133072157"}},
   {36, {14, "74b1e012be1521e5d8d75e745a26ced845ea3d37", "20112871792"}},
   {37, {23, "28c30fb11ed1da72e7c4f89c0164756e8a021d", "42387769980"}},
   {38, {21, "b190e2d40cfdeee2cee072954a2be89e7ba39364", "100251560595"}},
   {39, {23, "0b304f2a79a027270276533fe1ed4eff30910876", "146971536592"}},
   {40, {20, "95a156cd21b4a69de969eb6716864f4c8b82a82a", "323724968937"}},
   {41, {25, "d1562eb37357f9e6fc41cb2359f4d3eda4032329", "1003651412950"}},
   {42, {24, "8efb85f9c5b5db2d55973a04128dc7510075ae23", "1458252205147"}},
   {43, {19, "f92044c7924e5525c61207972c253c9fc9f086f7", "2895374552463"}},
   {44, {24, "80df54e1f612f2fc5bdc05c9d21a83aa8d20791e", "7409811047825"}},
   {45, {21, "f0225bfc68a6e17e87cd8b5e60ae3be18f120753", "15404761757071"}},
   {46, {24, "9a012260d01c5113df66c8a8438c9f7a1e3d5dac", "19996463086597"}},
   {47, {27, "f828005d41b0f4fed4c8dca3b06011072cfb07d4", "51408670348612"}},
   {48, {21, "8661cb56d9df0a61f01328b55af7e56a3fe7a2b2", "119666659114170"}},
   {49, {30, "0d2f533966c6578e1111978ca698f8add7fffdf3", "191206974700443"}},
   {50, {29, "de081b76f840e462fa2cdf360173dfaf4a976a47", "409118905032525"}},
   {51, {25, "ef6419cffd7fad7027994354eb8efae223c2dbe7", "611140496167764"}},
   {52, {27, "36af659edbe94453f6344e920d143f1778653ae7", "2058769515153876"}},
   {53, {26, "2f4870ef54fa4b048c1365d42594cc7d3d269551", "4216495639600700"}},
   {54, {30, "cb66763cf7fde659869ae7f06884d9a0f879a092", "6763683971478124"}},
   {55, {31, "db53d9bbd1f3a83b094eeca7dd970bd85b492fa2", "9974455244496707"}},
   {56, {31, "48214c5969ae9f43f75070cea1e2cb41d5bdcccd", "30045390491869460"}},
   {57, {33, "328660ef43f66abe2653fa178452a5dfc594c2a1", "44218742292676575"}},
   {58, {28, "8c2a6071f89c90c4dab5ab295d7729d1b54ea60f", "138245758910846492"}},
   {59, {30, "b14ed3146f5b2c9bde1703deae9ef33af8110210", "199976667976342049"}},
   {60, {31, "cdf8e5c7503a9d22642e3ecfc87817672787b9c5", "525070384258266191"}},
   {61, {25, "68133e19b2dfb9034edf9830a200cfdf38c90cbd", "1135041350219496382"}},
   {62, {35, "e26646db84b0602f32b34b5a62ca3cae1f91b779", "1425787542618654982"}},
   {63, {34, "ef58afb697b094423ce90721fbb19a359ef7c50e", "3908372542507822062"}},
   {64, {34, "3ee4133d991f52fdf6a25c9834e0745ac74248a4", "8993229949524469768"}},
   {65, {37, "52e763a7ddc1aa4fa811578c491c1bc7fd570137", "17799667357578236628"}},
   {66, {35, "20d45a6a762535700ce9e0b216e31994335db8a5", "30568377312064202855"}},
   {67, {31, "739437bb3dd6d1983e66629c5f08c70e52769371", "46346217550346335726"}},
   {68, {34, "e0b8a2baee1b77fc703455f39d51477451fc8cfc", "132656943602386256302"}}
};

// Global variables
vector<unsigned char> TARGET_HASH160_RAW(20);
string TARGET_HASH160;
Int BASE_KEY;
atomic<bool> stop_event(false);
mutex result_mutex;
queue<tuple<string, __uint128_t, int>> results;

// 128-bit counter
atomic<uint64_t> total_checked_high(0);
atomic<uint64_t> total_checked_low(0);
__uint128_t total_combinations = 0;
vector<string> g_threadPrivateKeys;
mutex progress_mutex;

// Performance tracking
atomic<uint64_t> globalComparedCount(0);
atomic<uint64_t> localComparedCount(0);
double globalElapsedTime = 0.0;
double mkeysPerSec = 0.0;
chrono::time_point<chrono::high_resolution_clock> tStart;

// Helper functions
__uint128_t load_128() {
   return (static_cast<__uint128_t>(total_checked_high.load()) << 64) | total_checked_low.load();
}

void increment_128() {
   uint64_t old_low = total_checked_low.fetch_add(1);
   if (old_low == UINT64_MAX) {
   total_checked_high.fetch_add(1);
   }
}

static string formatElapsedTime(double seconds) {
   int hrs = static_cast<int>(seconds) / 3600;
   int mins = (static_cast<int>(seconds) % 3600) / 60;
   int secs = static_cast<int>(seconds) % 60;
   ostringstream oss;
   oss << setw(2) << setfill('0') << hrs << ":"
   << setw(2) << setfill('0') << mins << ":"
   << setw(2) << setfill('0') << secs;
   return oss.str();
}

static string to_string_128(__uint128_t value) {
   if (value == 0) return "0";
   char buffer[50];
   char* p = buffer + sizeof(buffer);
   *--p = '\0';
   while (value != 0) {
   *--p = "0123456789"[value % 10];
   value /= 10;
   }
   return p;
}

void signalHandler(int signum) {
   stop_event.store(true);
   cout << "\nInterrupt received, shutting down...\n";
}

class CombinationGenerator {
   int n, k;
   vector<int> current;
   mt19937_64 rng;

public:
   CombinationGenerator(int n, int k, uint64_t seed = 0) : n(n), k(k), current(k), rng(seed) {
   if (k > n) k = n;
   for (int i = 0; i < k; ++i) current[i] = i;
   }

   static __uint128_t combinations_count(int n, int k) {
   if (k > n) return 0;
   if (k * 2 > n) k = n - k;
   if (k == 0) return 1;

   __uint128_t result = n;
   for(int i = 2; i <= k; ++i) {
   result *= (n - i + 1);
   result /= i;
   }
   return result;
   }

   const vector<int>& get() const { return current; }

   bool next() {
   int i = k - 1;
   while (i >= 0 && current[i] == n - k + i) --i;
   if (i < 0) return false;
   ++current[i];
   for (int j = i + 1; j < k; ++j)
   current[j] = current[j-1] + 1;
   return true;
   }

   void unrank(__uint128_t rank) {
   __uint128_t total = combinations_count(n, k);
   if (rank >= total) {
   current.clear();
   return;
   }

   current.resize(k);
   __uint128_t remaining_rank = rank;
   int a = n;
   int b = k;
   __uint128_t x = (total - 1) - rank;

   for (int i = 0; i < k; i++) {
   a = largest_a_where_comb_a_b_le_x(a, b, x);
   current[i] = (n - 1) - a;
   x -= combinations_count(a, b);
   b--;
   }
   }

private:
   int largest_a_where_comb_a_b_le_x(int a, int b, __uint128_t x) const {
   while (a >= b && combinations_count(a, b) > x) a--;
   return a;
   }
};

static void computeHash160BatchBinSingle(int numKeys,
   uint8_t pubKeys[][33],
   uint8_t hashResults[][20])
{
   alignas(32) array<array<uint8_t, 64>, HASH_BATCH_SIZE> shaInputs;
   alignas(32) array<array<uint8_t, 32>, HASH_BATCH_SIZE> shaOutputs;
   alignas(32) array<array<uint8_t, 64>, HASH_BATCH_SIZE> ripemdInputs;
   alignas(32) array<array<uint8_t, 20>, HASH_BATCH_SIZE> ripemdOutputs;

   const __uint128_t totalBatches = (numKeys + (HASH_BATCH_SIZE - 1)) / HASH_BATCH_SIZE;

   for (__uint128_t batch = 0; batch < totalBatches; batch++) {
   const __uint128_t batchCount = min<__uint128_t>(HASH_BATCH_SIZE, numKeys - batch * HASH_BATCH_SIZE);

   // Prepare SHA-256 input blocks
   for (__uint128_t i = 0; i < batchCount; i++) {
   memset(shaInputs[i].data(), 0, 64);
   memcpy(shaInputs[i].data(), pubKeys[batch * HASH_BATCH_SIZE + i], 33);
   shaInputs[i][33] = 0x80;
   *reinterpret_cast<uint32_t*>(&shaInputs[i][60]) = _byteswap_ulong(33 * 8);
   }

   if (batchCount < HASH_BATCH_SIZE) {
   static array<uint8_t, 64> shaPadding = {};
   memset(shaPadding.data(), 0, 64);
   memcpy(shaPadding.data(), pubKeys[0], 33);
   shaPadding[33] = 0x80;
   *reinterpret_cast<uint32_t*>(&shaPadding[60]) = _byteswap_ulong(33 * 8);
   for (__uint128_t i = batchCount; i < HASH_BATCH_SIZE; i++) {
   memcpy(shaInputs[i].data(), shaPadding.data(), 64);
   }
   }

   uint8_t* inPtr[HASH_BATCH_SIZE];
   uint8_t* outPtr[HASH_BATCH_SIZE];
   for (int i = 0; i < HASH_BATCH_SIZE; i++) {
   inPtr[i] = shaInputs[i].data();
   outPtr[i] = shaOutputs[i].data();
   }

   sha256avx2_8B(inPtr[0], inPtr[1], inPtr[2], inPtr[3],
   inPtr[4], inPtr[5], inPtr[6], inPtr[7],
   outPtr[0], outPtr[1], outPtr[2], outPtr[3],
   outPtr[4], outPtr[5], outPtr[6], outPtr[7]);

   for (__uint128_t i = 0; i < batchCount; i++) {
   memset(ripemdInputs[i].data(), 0, 64);
   memcpy(ripemdInputs[i].data(), shaOutputs[i].data(), 32);
   ripemdInputs[i][32] = 0x80;
   *reinterpret_cast<uint32_t*>(&ripemdInputs[i][60]) = _byteswap_ulong(256);
   }

   if (batchCount < HASH_BATCH_SIZE) {
   static array<uint8_t, 64> ripemdPadding = {};
   memset(ripemdPadding.data(), 0, 64);
   memcpy(ripemdPadding.data(), shaOutputs[0].data(), 32);
   ripemdPadding[32] = 0x80;
   *reinterpret_cast<uint32_t*>(&ripemdPadding[60]) = _byteswap_ulong(256);
   for (__uint128_t i = batchCount; i < HASH_BATCH_SIZE; i++) {
   memcpy(ripemdInputs[i].data(), ripemdPadding.data(), 64);
   }
   }

   for (int i = 0; i < HASH_BATCH_SIZE; i++) {
   inPtr[i] = ripemdInputs[i].data();
   outPtr[i] = ripemdOutputs[i].data();
   }

   ripemd160avx2::ripemd160avx2_32(
   inPtr[0], inPtr[1], inPtr[2], inPtr[3],
   inPtr[4], inPtr[5], inPtr[6], inPtr[7],
   outPtr[0], outPtr[1], outPtr[2], outPtr[3],
   outPtr[4], outPtr[5], outPtr[6], outPtr[7]
   );

   for (__uint128_t i = 0; i < batchCount; i++) {
   memcpy(hashResults[batch * HASH_BATCH_SIZE + i], ripemdOutputs[i].data(), 20);
   }
   }
}

void worker(Secp256K1* secp, int bit_length, int flip_count, int threadId, __uint128_t start, __uint128_t end) {
   const int fullBatchSize = 2 * POINTS_BATCH_SIZE;
   uint8_t localPubKeys[HASH_BATCH_SIZE][33];
   uint8_t localHashResults[HASH_BATCH_SIZE][20];
   int pointIndices[HASH_BATCH_SIZE];

   // Precompute points
   vector<Point> plusPoints(POINTS_BATCH_SIZE);
   vector<Point> minusPoints(POINTS_BATCH_SIZE);
   for (int i = 0; i < POINTS_BATCH_SIZE; i++) {
   Int tmp; tmp.SetInt32(i);
   plusPoints[i] = secp->ComputePublicKey(&tmp);
   minusPoints[i] = plusPoints[i];
   minusPoints[i].y.ModNeg();
   }

   // Structure of Arrays
   vector<Int> deltaX(POINTS_BATCH_SIZE);
   IntGroup modGroup(POINTS_BATCH_SIZE);
   vector<Int> pointBatchX(fullBatchSize);
   vector<Int> pointBatchY(fullBatchSize);

   // Random number generation
   random_device rd;
   mt19937_64 rng(rd() + threadId);
   uniform_int_distribution<uint64_t> dist(0, UINT64_MAX);

   CombinationGenerator gen(bit_length, flip_count, rd() + threadId);
   gen.unrank(start);

   for (__uint128_t count = start; !stop_event.load() && count < end; ) {
   Int currentKey;
   currentKey.Set(&BASE_KEY);

   const vector<int>& flips = gen.get();

   // Apply flips
   for (int pos : flips) {
   Int mask; mask.SetInt32(1); mask.ShiftL(pos);
   Int temp; temp.Set(&currentKey); temp.ShiftR(pos);
   temp.IsEven() ? currentKey.Add(&mask) : currentKey.Sub(&mask);
   }

   // Store private key
   string keyStr = currentKey.GetBase16();
   keyStr = string(64 - keyStr.length(), '0') + keyStr;
   #pragma omp critical
   g_threadPrivateKeys[threadId] = keyStr;

   // Compute public key
   Point startPoint = secp->ComputePublicKey(&currentKey);
   Int startPointX = startPoint.x, startPointY = startPoint.y, startPointXNeg = startPointX;
   startPointXNeg.ModNeg();

   // Compute deltaX values
   for (int i = 0; i < POINTS_BATCH_SIZE; i += 4) {
   deltaX[i].ModSub(&plusPoints[i].x, &startPointX);
   deltaX[i+1].ModSub(&plusPoints[i+1].x, &startPointX);
   deltaX[i+2].ModSub(&plusPoints[i+2].x, &startPointX);
   deltaX[i+3].ModSub(&plusPoints[i+3].x, &startPointX);
   }
   modGroup.Set(deltaX.data());
   modGroup.ModInv();

   // Process points
   for (int i = 0; i < POINTS_BATCH_SIZE; i += 4) {
   for (int j = 0; j < 4; j++) {
   // Plus points
   Int deltaY; deltaY.ModSub(&plusPoints[i+j].y, &startPointY);
   Int slope; slope.ModMulK1(&deltaY, &deltaX[i+j]);
   Int slopeSq; slopeSq.ModSquareK1(&slope);

   pointBatchX[i+j].Set(&startPointXNeg);
   pointBatchX[i+j].ModAdd(&slopeSq);
   pointBatchX[i+j].ModSub(&plusPoints[i+j].x);

   Int diffX; diffX.ModSub(&startPointX, &pointBatchX[i+j]);
   diffX.ModMulK1(&slope);

   pointBatchY[i+j].Set(&startPointY);
   pointBatchY[i+j].ModNeg();
   pointBatchY[i+j].ModAdd(&diffX);

   // Minus points
   deltaY.ModSub(&minusPoints[i+j].y, &startPointY);
   slope.ModMulK1(&deltaY, &deltaX[i+j]);
   slopeSq.ModSquareK1(&slope);

   pointBatchX[POINTS_BATCH_SIZE+i+j].Set(&startPointXNeg);
   pointBatchX[POINTS_BATCH_SIZE+i+j].ModAdd(&slopeSq);
   pointBatchX[POINTS_BATCH_SIZE+i+j].ModSub(&minusPoints[i+j].x);

   diffX.ModSub(&startPointX, &pointBatchX[POINTS_BATCH_SIZE+i+j]);
   diffX.ModMulK1(&slope);

   pointBatchY[POINTS_BATCH_SIZE+i+j].Set(&startPointY);
   pointBatchY[POINTS_BATCH_SIZE+i+j].ModNeg();
   pointBatchY[POINTS_BATCH_SIZE+i+j].ModAdd(&diffX);
   }
   }

   // Process keys in batches
   int localBatchCount = 0;
   for (int i = 0; i < fullBatchSize && localBatchCount < HASH_BATCH_SIZE; i++) {
   Point tempPoint;
   tempPoint.x.Set(&pointBatchX[i]);
   tempPoint.y.Set(&pointBatchY[i]);

   // Convert to compressed public key
   localPubKeys[localBatchCount][0] = tempPoint.y.IsEven() ? 0x02 : 0x03;
   for (int j = 0; j < 32; j++) {
   localPubKeys[localBatchCount][1 + j] = pointBatchX[i].GetByte(31 - j);
   }
   pointIndices[localBatchCount] = i;
   localBatchCount++;

   if (localBatchCount == HASH_BATCH_SIZE) {
   computeHash160BatchBinSingle(localBatchCount, localPubKeys, localHashResults);
   localComparedCount += HASH_BATCH_SIZE;

   __m256i target = _mm256_loadu_si256(reinterpret_cast<const __m256i*>(TARGET_HASH160_RAW.data()));

   for (int j = 0; j < HASH_BATCH_SIZE; j++) {
   __m256i result = _mm256_loadu_si256(reinterpret_cast<const __m256i*>(localHashResults[j]));

   int mask = _mm256_movemask_epi8(_mm256_cmpeq_epi8(result, target));
   const int HASH160_MASK = (1 << 20) - 1;

   if ((mask & HASH160_MASK) == HASH160_MASK) {
   bool fullMatch = true;
   for (int k = 0; k < 20; k++) {
   if (localHashResults[j][k] != TARGET_HASH160_RAW[k]) {
   fullMatch = false;
   break;
   }
   }

   if (fullMatch) {
   auto tEndTime = chrono::high_resolution_clock::now();
   globalElapsedTime = chrono::duration<double>(tEndTime - tStart).count();
   mkeysPerSec = (double)(globalComparedCount + localComparedCount) / globalElapsedTime / 1e6;

   Int foundKey; foundKey.Set(&currentKey);
   int idx = pointIndices[j];
   if (idx < POINTS_BATCH_SIZE) {
   Int offset; offset.SetInt32(idx);
   foundKey.Add(&offset);
   } else {
   Int offset; offset.SetInt32(idx - POINTS_BATCH_SIZE);
   foundKey.Sub(&offset);
   }

   string hexKey = foundKey.GetBase16();
   hexKey = string(64 - hexKey.length(), '0') + hexKey;

   lock_guard<mutex> lock(result_mutex);
   results.push(make_tuple(hexKey, load_128(), flip_count));
   stop_event.store(true);
   return;
   }
   }
   }

   increment_128();
   localBatchCount = 0;

   // Progress reporting
   __uint128_t current_total = load_128();
   if (current_total % REPORT_INTERVAL == 0 || count == end - 1) {
   auto now = chrono::high_resolution_clock::now();
   globalElapsedTime = chrono::duration<double>(now - tStart).count();

   globalComparedCount += localComparedCount;
   localComparedCount = 0;
   mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

   double progress = min(100.0, (double)current_total / total_combinations * 100.0);

   lock_guard<mutex> lock(progress_mutex);
   moveCursorTo(0, 9);
   cout << "Progress: " << fixed << setprecision(6) << progress << "%\n";
   cout << "Processed: " << to_string_128(current_total) << "\n";
   cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";
   cout << "Elapsed Time: " << formatElapsedTime(globalElapsedTime) << "\n";
   cout.flush();

   if (current_total >= total_combinations) {
   stop_event.store(true);
   break;
   }
   }
   }
   }

   if (!gen.next()) break;
   count++;
   }

   if (!stop_event.load() && load_128() >= total_combinations) {
   stop_event.store(true);
   }
}

void printUsage(const char* programName) {
   cout << "Usage: " << programName << " [options]\n";
   cout << "Options:\n";
   cout << " -p, --puzzle NUM Puzzle number to solve (default: 20)\n";
   cout << " -t, --threads NUM Number of CPU cores to use (default: all)\n";
   cout << " -f, --flips NUM Override default flip count for puzzle\n";
   cout << " -h, --help Show this help message\n";
   cout << "\nExample:\n";
   cout << " " << programName << " -p 38 -t 8 -f 21\n";
}

int main(int argc, char* argv[]) {
   signal(SIGINT, signalHandler);

   // Parse command line arguments
   int opt;
   int option_index = 0;
   static struct option long_options[] = {
   {"puzzle", required_argument, 0, 'p'},
   {"threads", required_argument, 0, 't'},
   {"flips", required_argument, 0, 'f'},
   {"help", no_argument, 0, 'h'},
   {0, 0, 0, 0}
   };

   while ((opt = getopt_long(argc, argv, "p:t:f:h", long_options, &option_index)) != -1) {
   switch (opt) {
   case 'p':
   PUZZLE_NUM = atoi(optarg);
   if (PUZZLE_NUM < 20 || PUZZLE_NUM > 68) {
   cerr << "Error: Puzzle number must be between 20 and 68\n";
   return 1;
   }
   break;
   case 't':
   WORKERS = atoi(optarg);
   if (WORKERS < 1) {
   cerr << "Error: Thread count must be at least 1\n";
   return 1;
   }
   break;
   case 'f':
   FLIP_COUNT = atoi(optarg);
   if (FLIP_COUNT < 1) {
   cerr << "Error: Flip count must be at least 1\n";
   return 1;
   }
   break;
   case 'h':
   printUsage(argv[0]);
   return 0;
   default:
   printUsage(argv[0]);
   return 1;
   }
   }

   tStart = chrono::high_resolution_clock::now();

   Secp256K1 secp;
   secp.Init();

   auto puzzle_it = PUZZLE_DATA.find(PUZZLE_NUM);
   if (puzzle_it == PUZZLE_DATA.end()) {
   cerr << "Error: Invalid puzzle number\n";
   return 1;
   }

   auto [DEFAULT_FLIP_COUNT, TARGET_HASH160_HEX, PRIVATE_KEY_DECIMAL] = puzzle_it->second;

   if (FLIP_COUNT == -1) FLIP_COUNT = DEFAULT_FLIP_COUNT;
   TARGET_HASH160 = TARGET_HASH160_HEX;

   // Convert target hash to bytes
   for (__uint128_t i = 0; i < 20; i++) {
   TARGET_HASH160_RAW[i] = stoul(TARGET_HASH160.substr(i * 2, 2), nullptr, 16);
   }

   // Set base key
   BASE_KEY.SetBase10(const_cast<char*>(PRIVATE_KEY_DECIMAL.c_str()));

   // Verify base key
   Int testKey;
   testKey.SetBase10(const_cast<char*>(PRIVATE_KEY_DECIMAL.c_str()));
   if (!testKey.IsEqual(&BASE_KEY)) {
   cerr << "Base key initialization failed!\n";
   return 1;
   }

   if (BASE_KEY.GetBitLength() > PUZZLE_NUM) {
   cerr << "Base key exceeds puzzle bit length!\n";
   return 1;
   }

   // Calculate total combinations
   total_combinations = CombinationGenerator::combinations_count(PUZZLE_NUM, FLIP_COUNT);

   // Format base key for display
   string paddedKey = BASE_KEY.GetBase16();
   size_t firstNonZero = paddedKey.find_first_not_of('0');
   paddedKey = paddedKey.substr(firstNonZero);
   paddedKey = "0x" + paddedKey;

   // Print initial header
   clearTerminal();
   cout << "=======================================\n";
   cout << "== Mutagen Puzzle Solver by Denevron ==\n";
   cout << "=======================================\n";
   cout << "Starting puzzle: " << PUZZLE_NUM << " (" << PUZZLE_NUM << "-bit)\n";
   cout << "Target HASH160: " << TARGET_HASH160.substr(0, 10) << "..." << TARGET_HASH160.substr(TARGET_HASH160.length()-10) << "\n";
   cout << "Base Key: " << paddedKey << "\n";
   cout << "Flip count: " << FLIP_COUNT << " ";
   if (FLIP_COUNT != DEFAULT_FLIP_COUNT) {
   cout << "(override, default was " << DEFAULT_FLIP_COUNT << ")";
   }
   cout << "\n";
   cout << "Total combinations for C(" << PUZZLE_NUM << "," << FLIP_COUNT << "): " << to_string_128(total_combinations) << "\n";
   cout << "Using: " << WORKERS << " threads\n";
   // Print empty lines for progress display
   cout << "Progress: 0.000000%\n";
   cout << "Processed: 0\n";
   cout << "Speed: 0.00 Mkeys/s\n";
   cout << "Elapsed Time: 00:00:00\n";

   g_threadPrivateKeys.resize(WORKERS, "0");
   vector<thread> threads;

   // Distribute work
   __uint128_t comb_per_thread = total_combinations / WORKERS;
   __uint128_t remainder = total_combinations % WORKERS;

   for (int i = 0; i < WORKERS; i++) {
   __uint128_t start = i * comb_per_thread + min((__uint128_t)i, remainder);
   __uint128_t end = start + comb_per_thread + (i < remainder ? 1 : 0);
   threads.emplace_back(worker, &secp, PUZZLE_NUM, FLIP_COUNT, i, start, end);
   }

   for (auto& t : threads) {
   if (t.joinable()) t.join();
   }

   if (!results.empty()) {
   auto [hex_key, checked, flips] = results.front();
   globalElapsedTime = chrono::duration<double>(chrono::high_resolution_clock::now() - tStart).count();
   mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

   string compactHex = hex_key;
   size_t firstNonZero = compactHex.find_first_not_of('0');
   compactHex = "0x" + compactHex.substr(firstNonZero);

   cout << "=======================================\n";
   cout << "=========== SOLUTION FOUND ============\n";
   cout << "=======================================\n";
   cout << "Private key: " << compactHex << "\n";
   cout << "Checked " << to_string_128(checked) << " combinations\n";
   cout << "Bit flips: " << flips << endl;
   cout << "Time: " << fixed << setprecision(2) << globalElapsedTime << " seconds ("
   << formatElapsedTime(globalElapsedTime) << ")\n";
   cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";

   // Save solution
   ofstream out("puzzle_" + to_string(PUZZLE_NUM) + "_solution.txt");
   if (out) {
   out << hex_key;
   out.close();
   cout << "Solution saved to puzzle_" << PUZZLE_NUM << "_solution.txt\n";
   } else {
   cerr << "Failed to save solution to file!\n";
   }
   } else {
   __uint128_t final_count = load_128();
   globalElapsedTime = chrono::duration<double>(chrono::high_resolution_clock::now() - tStart).count();
   mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

   cout << "\n\nNo solution found. Checked " << to_string_128(load_128()) << " combinations\n";
   cout << "Time: " << fixed << setprecision(2) << globalElapsedTime << " seconds ("
   << formatElapsedTime(globalElapsedTime) << ")\n";
   cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";
   }

   return 0;
}

No speed loss anymore(?)
Works on puzzle 68.
Most stable speed so far.

Error compiling!

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 21:44:32 UTC

Quote from: user_not_here on Today at 07:32:10 PM

Build on Commit b97c178 fixed for Cross-platform terminal functions ( NoMachine1 / Mutagen )

Key changes made:

   Added random number generation to the CombinationGenerator class with a thread-specific seed

   Modified the worker function to:
   Generate random combinations within its assigned range using a uniform distribution
   Use a thread-local random number generator with a unique seed
   Process combinations in random order rather than sequentially

   The CombinationGenerator now has:
   A random() method to generate random combinations
   Thread-safe random number generation with proper seeding
   Fisher-Yates shuffle for generating random combinations

   Each worker thread now:
   Gets its own range of combinations to check
   Processes them in random order within that range
   Uses a unique seed for its random number generator

Code:

No speed loss anymore(?)
Works on puzzle 68.

compiled okay!

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 20:49:50 UTC

Quote from: bibilgin on Today at 08:31:17 PM

Quote from: cctv5go on March 20, 2025, 03:46:28 AM

Do you want to exchange address prefixes?1MVDYgVaSN6iKJMdVvjz5EEm7Vb6Hvpygo
target:1MVDYgVaSN6iKKEsbzRUAYFrYJadLYZvvZ

1MVDYgVaSN6iKKEsbzRUAYFrYJadLYZvvZ
Are you sure it's in the 68 bit range? Do you have the hex?

Quote from: kTimesG on Today at 07:15:48 PM

OK, I have to finally admit it.

.....
But now I have another problem: I have so many prefixes, that computing the next ranges to scan takes a longer time than actually scanning the ranges. This mad spiral requires some GPUs to do the computation of the offsets for the GPU computation (of the offsets of the next GPU computation, because too many prefixes...), what do you think??

I'm gonna start swapping some sweet ranges soon with ya guys, tune up your telegrams if you've got your match-9 or longer address prefixes nicely lined up!

Am I seeing it wrong?

KTimesG -
Is it starting to search for prefixes? Is it doing calculations?

Guys, I was away for a while. I was out of town for a few weeks.
I came home today and I'm continuing to scan.

Welcome back Grin

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 17:11:35 UTC

Quote from: user_not_here on Today at 04:38:44 PM

speed is same as original code but this is random

got you, but as you say is crashing with high puzzle

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 01/04/2025, 16:32:03 UTC

Quote from: user_not_here on Today at 04:09:25 PM

1. Thread-Based Work Distribution
   Added: Thread management with std::thread
   Changed: Split work into N equal ranges (where N = worker count)
   Benefit: Evenly distributes workload across all CPU cores

2. Thread-Local Randomization
   Added: Per-thread Mersenne Twister RNG (mt19937_64)
   Changed: Fisher-Yates shuffle within each thread's range only
   Benefit: Eliminates contention while maintaining randomness

3. Range-Based Processing
   Added: Explicit start/end indices for each thread
   Removed: Global sequential counter
   Benefit: No shared state between threads

4. Progress Reporting Optimization
   Changed: Time-based (1 second) instead of count-based updates
   Added: Atomic timestamp for coordination
   Benefit: Reduces console I/O overhead

5. Result Handling
   Added: Thread-safe queue for results
   Changed: Early termination when solution found
   Benefit: Fast termination without losing results

Code:

#include <iostream>
#include <array>
#include <iomanip>
#include <fstream>
#include <sstream>
#include <vector>
#include <thread>
#include <atomic>
#include <chrono>
#include <queue>
#include <mutex>
#include <cstring>
#include <unordered_map>
#include <cmath>
#include <immintrin.h>
#include <omp.h>
#include <csignal>
#include <random>
#include <algorithm>
#include <getopt.h>

#ifdef _WIN32
   #include <windows.h>
#endif

// Include the required headers
#include "sha256_avx2.h"
#include "ripemd160_avx2.h"
#include "SECP256K1.h"
#include "Point.h"
#include "Int.h"
#include "IntGroup.h"

using namespace std;

// Cross-platform terminal functions
void initConsole() {
#ifdef _WIN32
   HANDLE hConsole = GetStdHandle(STD_OUTPUT_HANDLE);
   DWORD mode = 0;
   GetConsoleMode(hConsole, &mode);
   SetConsoleMode(hConsole, mode | ENABLE_VIRTUAL_TERMINAL_PROCESSING);
#endif
}

void clearTerminal() {
#ifdef _WIN32
   HANDLE hStdOut = GetStdHandle(STD_OUTPUT_HANDLE);
   COORD coord = {0, 0};
   DWORD count;
   CONSOLE_SCREEN_BUFFER_INFO csbi;
   GetConsoleScreenBufferInfo(hStdOut, &csbi);
   FillConsoleOutputCharacter(hStdOut, ' ', csbi.dwSize.X * csbi.dwSize.Y, coord, &count);
   SetConsoleCursorPosition(hStdOut, coord);
#else
   std::cout << "\033[2J\033[H";
#endif
   std::cout.flush();
}

void moveCursorTo(int x, int y) {
#ifdef _WIN32
   HANDLE hStdOut = GetStdHandle(STD_OUTPUT_HANDLE);
   COORD coord = {(SHORT)x, (SHORT)y};
   SetConsoleCursorPosition(hStdOut, coord);
#else
   std::cout << "\033[" << y << ";" << x << "H";
#endif
   std::cout.flush();
}

// Configuration
int PUZZLE_NUM = 20;
int WORKERS = omp_get_num_procs();
int FLIP_COUNT = -1;
static constexpr int POINTS_BATCH_SIZE = 256;
static constexpr int HASH_BATCH_SIZE = 8;

// Puzzle data
const unordered_map<int, tuple<int, string, string>> PUZZLE_DATA = {
   {20, {8, "b907c3a2a3b27789dfb509b730dd47703c272868", "357535"}},
   {21, {9, "29a78213caa9eea824acf08022ab9dfc83414f56", "863317"}},
   {22, {11, "7ff45303774ef7a52fffd8011981034b258cb86b", "1811764"}},
   {23, {12, "d0a79df189fe1ad5c306cc70497b358415da579e", "3007503"}},
   {24, {9, "0959e80121f36aea13b3bad361c15dac26189e2f", "5598802"}},
   {25, {12, "2f396b29b27324300d0c59b17c3abc1835bd3dbb", "14428676"}},
   {26, {14, "bfebb73562d4541b32a02ba664d140b5a574792f", "33185509"}},
   {27, {13, "0c7aaf6caa7e5424b63d317f0f8f1f9fa40d5560", "54538862"}},
   {28, {16, "1306b9e4ff56513a476841bac7ba48d69516b1da", "111949941"}},
   {29, {18, "5a416cc9148f4a377b672c8ae5d3287adaafadec", "227634408"}},
   {30, {16, "d39c4704664e1deb76c9331e637564c257d68a08", "400708894"}},
   {31, {13, "d805f6f251f7479ebd853b3d0f4b9b2656d92f1d", "1033162084"}},
   {32, {14, "9e42601eeaedc244e15f17375adb0e2cd08efdc9", "2102388551"}},
   {33, {15, "4e15e5189752d1eaf444dfd6bff399feb0443977", "3093472814"}},
   {34, {16, "f6d67d7983bf70450f295c9cb828daab265f1bfa", "7137437912"}},
   {35, {19, "f6d8ce225ffbdecec170f8298c3fc28ae686df25", "14133072157"}},
   {36, {14, "74b1e012be1521e5d8d75e745a26ced845ea3d37", "20112871792"}},
   {37, {23, "28c30fb11ed1da72e7c4f89c0164756e8a021d", "42387769980"}},
   {38, {21, "b190e2d40cfdeee2cee072954a2be89e7ba39364", "100251560595"}},
   {39, {23, "0b304f2a79a027270276533fe1ed4eff30910876", "146971536592"}},
   {40, {20, "95a156cd21b4a69de969eb6716864f4c8b82a82a", "323724968937"}},
   {41, {25, "d1562eb37357f9e6fc41cb2359f4d3eda4032329", "1003651412950"}},
   {42, {24, "8efb85f9c5b5db2d55973a04128dc7510075ae23", "1458252205147"}},
   {43, {19, "f92044c7924e5525c61207972c253c9fc9f086f7", "2895374552463"}},
   {44, {24, "80df54e1f612f2fc5bdc05c9d21a83aa8d20791e", "7409811047825"}},
   {45, {21, "f0225bfc68a6e17e87cd8b5e60ae3be18f120753", "15404761757071"}},
   {46, {24, "9a012260d01c5113df66c8a8438c9f7a1e3d5dac", "19996463086597"}},
   {47, {27, "f828005d41b0f4fed4c8dca3b06011072cfb07d4", "51408670348612"}},
   {48, {21, "8661cb56d9df0a61f01328b55af7e56a3fe7a2b2", "119666659114170"}},
   {49, {30, "0d2f533966c6578e1111978ca698f8add7fffdf3", "191206974700443"}},
   {50, {29, "de081b76f840e462fa2cdf360173dfaf4a976a47", "409118905032525"}},
   {51, {25, "ef6419cffd7fad7027994354eb8efae223c2dbe7", "611140496167764"}},
   {52, {27, "36af659edbe94453f6344e920d143f1778653ae7", "2058769515153876"}},
   {53, {26, "2f4870ef54fa4b048c1365d42594cc7d3d269551", "4216495639600700"}},
   {54, {30, "cb66763cf7fde659869ae7f06884d9a0f879a092", "6763683971478124"}},
   {55, {31, "db53d9bbd1f3a83b094eeca7dd970bd85b492fa2", "9974455244496707"}},
   {56, {31, "48214c5969ae9f43f75070cea1e2cb41d5bdcccd", "30045390491869460"}},
   {57, {33, "328660ef43f66abe2653fa178452a5dfc594c2a1", "44218742292676575"}},
   {58, {28, "8c2a6071f89c90c4dab5ab295d7729d1b54ea60f", "138245758910846492"}},
   {59, {30, "b14ed3146f5b2c9bde1703deae9ef33af8110210", "199976667976342049"}},
   {60, {31, "cdf8e5c7503a9d22642e3ecfc87817672787b9c5", "525070384258266191"}},
   {61, {25, "68133e19b2dfb9034edf9830a200cfdf38c90cbd", "1135041350219496382"}},
   {62, {35, "e26646db84b0602f32b34b5a62ca3cae1f91b779", "1425787542618654982"}},
   {63, {34, "ef58afb697b094423ce90721fbb19a359ef7c50e", "3908372542507822062"}},
   {64, {34, "3ee4133d991f52fdf6a25c9834e0745ac74248a4", "8993229949524469768"}},
   {65, {37, "52e763a7ddc1aa4fa811578c491c1bc7fd570137", "17799667357578236628"}},
   {66, {35, "20d45a6a762535700ce9e0b216e31994335db8a5", "30568377312064202855"}},
   {67, {31, "739437bb3dd6d1983e66629c5f08c70e52769371", "46346217550346335726"}},
   {68, {34, "e0b8a2baee1b77fc703455f39d51477451fc8cfc", "132656943602386256302"}}
};

// Global variables
vector<unsigned char> TARGET_HASH160_RAW(20);
string TARGET_HASH160;
Int BASE_KEY;
atomic<bool> stop_event(false);
mutex result_mutex;
queue<tuple<string, size_t, int>> results;
atomic<size_t> total_checked(0);
size_t total_combinations = 0;
vector<string> g_threadPrivateKeys;
mutex progress_mutex;

// Performance tracking
atomic<uint64_t> globalComparedCount(0);
atomic<uint64_t> localComparedCount(0);
double globalElapsedTime = 0.0;
double mkeysPerSec = 0.0;
chrono::time_point<chrono::high_resolution_clock> tStart;
atomic<chrono::time_point<chrono::high_resolution_clock>> lastReportTime(chrono::high_resolution_clock::now());

string formatElapsedTime(double seconds) {
   int hrs = static_cast<int>(seconds) / 3600;
   int mins = (static_cast<int>(seconds) % 3600) / 60;
   int secs = static_cast<int>(seconds) % 60;
   ostringstream oss;
   oss << setw(2) << setfill('0') << hrs << ":"
   << setw(2) << setfill('0') << mins << ":"
   << setw(2) << setfill('0') << secs;
   return oss.str();
}

void signalHandler(int signum) {
   stop_event.store(true);
   cout << "\nInterrupt received, shutting down...\n";
}

class CombinationGenerator {
   int n, k;
   vector<int> current;

public:
   CombinationGenerator(int n, int k) : n(n), k(k), current(k) {
   for (int i = 0; i < k; ++i) current[i] = i;
   }

   static size_t combinations_count(int n, int k) {
   if (k > n) return 0;
   if (k * 2 > n) k = n - k;
   if (k == 0) return 1;

   size_t result = n;
   for(int i = 2; i <= k; ++i) {
   result *= (n - i + 1);
   result /= i;
   }
   return result;
   }

   const vector<int>& get() const { return current; }

   bool next() {
   int i = k - 1;
   while (i >= 0 && current[i] == n - k + i) --i;
   if (i < 0) return false;

   ++current[i];
   for (int j = i + 1; j < k; ++j)
   current[j] = current[j-1] + 1;
   return true;
   }

   void unrank(size_t rank) {
   if (rank >= combinations_count(n, k)) {
   current.clear();
   return;
   }

   current.resize(k);
   size_t remaining_rank = rank;
   int a = n;
   int b = k;
   size_t x = (combinations_count(n, k) - 1) - rank;

   for (int i = 0; i < k; i++) {
   a = largest_a_where_comb_a_b_le_x(a, b, x);
   current[i] = (n - 1) - a;
   x -= combinations_count(a, b);
   b--;
   }
   }

private:
   int largest_a_where_comb_a_b_le_x(int a, int b, size_t x) const {
   while (a >= b && combinations_count(a, b) > x) {
   a--;
   }
   return a;
   }
};

inline void prepareShaBlock(const uint8_t* dataSrc, size_t dataLen, uint8_t* outBlock) {
   std::fill_n(outBlock, 64, 0);
   std::memcpy(outBlock, dataSrc, dataLen);
   outBlock[dataLen] = 0x80;
   const uint32_t bitLen = (uint32_t)(dataLen * 8);
   outBlock[60] = (uint8_t)((bitLen >> 24) & 0xFF);
   outBlock[61] = (uint8_t)((bitLen >> 16) & 0xFF);
   outBlock[62] = (uint8_t)((bitLen >> 8) & 0xFF);
   outBlock[63] = (uint8_t)( bitLen & 0xFF);
}

inline void prepareRipemdBlock(const uint8_t* dataSrc, uint8_t* outBlock) {
   std::fill_n(outBlock, 64, 0);
   std::memcpy(outBlock, dataSrc, 32);
   outBlock[32] = 0x80;
   const uint32_t bitLen = 256;
   outBlock[60] = (uint8_t)((bitLen >> 24) & 0xFF);
   outBlock[61] = (uint8_t)((bitLen >> 16) & 0xFF);
   outBlock[62] = (uint8_t)((bitLen >> 8) & 0xFF);
   outBlock[63] = (uint8_t)( bitLen & 0xFF);
}

static void computeHash160BatchBinSingle(int numKeys,
   uint8_t pubKeys[][33],
   uint8_t hashResults[][20])
{
   alignas(32) std::array<std::array<uint8_t, 64>, HASH_BATCH_SIZE> shaInputs;
   alignas(32) std::array<std::array<uint8_t, 32>, HASH_BATCH_SIZE> shaOutputs;
   alignas(32) std::array<std::array<uint8_t, 64>, HASH_BATCH_SIZE> ripemdInputs;
   alignas(32) std::array<std::array<uint8_t, 20>, HASH_BATCH_SIZE> ripemdOutputs;

   const size_t totalBatches = (numKeys + (HASH_BATCH_SIZE - 1)) / HASH_BATCH_SIZE;

   for (size_t batch = 0; batch < totalBatches; batch++) {
   const size_t batchCount = std::min<size_t>(HASH_BATCH_SIZE, numKeys - batch * HASH_BATCH_SIZE);

   // Prepare SHA-256 input blocks
   for (size_t i = 0; i < batchCount; i++) {
   prepareShaBlock(pubKeys[batch * HASH_BATCH_SIZE + i], 33, shaInputs[i].data());
   }

   if (batchCount < HASH_BATCH_SIZE) {
   static std::array<uint8_t, 64> shaPadding = {};
   prepareShaBlock(pubKeys[0], 33, shaPadding.data());
   for (size_t i = batchCount; i < HASH_BATCH_SIZE; i++) {
   std::memcpy(shaInputs[i].data(), shaPadding.data(), 64);
   }
   }

   const uint8_t* inPtr[HASH_BATCH_SIZE];
   uint8_t* outPtr[HASH_BATCH_SIZE];
   for (int i = 0; i < HASH_BATCH_SIZE; i++) {
   inPtr[i] = shaInputs[i].data();
   outPtr[i] = shaOutputs[i].data();
   }

   sha256avx2_8B(inPtr[0], inPtr[1], inPtr[2], inPtr[3],
   inPtr[4], inPtr[5], inPtr[6], inPtr[7],
   outPtr[0], outPtr[1], outPtr[2], outPtr[3],
   outPtr[4], outPtr[5], outPtr[6], outPtr[7]);

   for (size_t i = 0; i < batchCount; i++) {
   prepareRipemdBlock(shaOutputs[i].data(), ripemdInputs[i].data());
   }

   if (batchCount < HASH_BATCH_SIZE) {
   static std::array<uint8_t, 64> ripemdPadding = {};
   prepareRipemdBlock(shaOutputs[0].data(), ripemdPadding.data());
   for (size_t i = batchCount; i < HASH_BATCH_SIZE; i++) {
   std::memcpy(ripemdInputs[i].data(), ripemdPadding.data(), 64);
   }
   }

   for (int i = 0; i < HASH_BATCH_SIZE; i++) {
   inPtr[i] = ripemdInputs[i].data();
   outPtr[i] = ripemdOutputs[i].data();
   }

   ripemd160avx2::ripemd160avx2_32(
   (unsigned char*)inPtr[0], (unsigned char*)inPtr[1],
   (unsigned char*)inPtr[2], (unsigned char*)inPtr[3],
   (unsigned char*)inPtr[4], (unsigned char*)inPtr[5],
   (unsigned char*)inPtr[6], (unsigned char*)inPtr[7],
   outPtr[0], outPtr[1], outPtr[2], outPtr[3],
   outPtr[4], outPtr[5], outPtr[6], outPtr[7]
   );

   for (size_t i = 0; i < batchCount; i++) {
   std::memcpy(hashResults[batch * HASH_BATCH_SIZE + i], ripemdOutputs[i].data(), 20);
   }
   }
}

void worker(Secp256K1* secp, int bit_length, int flip_count, int threadId,
   size_t start, size_t end, mt19937_64& rng) {
   const int fullBatchSize = 2 * POINTS_BATCH_SIZE;
   uint8_t localPubKeys[HASH_BATCH_SIZE][33];
   uint8_t localHashResults[HASH_BATCH_SIZE][20];
   int pointIndices[HASH_BATCH_SIZE];

   // Precompute points
   vector<Point> plusPoints(POINTS_BATCH_SIZE);
   vector<Point> minusPoints(POINTS_BATCH_SIZE);
   for (int i = 0; i < POINTS_BATCH_SIZE; i++) {
   Int tmp; tmp.SetInt32(i);
   plusPoints[i] = secp->ComputePublicKey(&tmp);
   minusPoints[i] = plusPoints[i];
   minusPoints[i].y.ModNeg();
   }

   // Structure of Arrays
   vector<Int> deltaX(POINTS_BATCH_SIZE);
   IntGroup modGroup(POINTS_BATCH_SIZE);
   vector<Int> pointBatchX(fullBatchSize);
   vector<Int> pointBatchY(fullBatchSize);

   CombinationGenerator gen(bit_length, flip_count);

   // Create and shuffle indices within thread's range
   vector<size_t> indices;
   indices.reserve(end - start);

   for(size_t i = start; i < end; i++) {
   indices.push_back(i);
   }

   // Fisher-Yates shuffle only within this thread's range
   for(size_t i = indices.size() - 1; i > 0; --i) {
   size_t j = rng() % (i + 1);
   swap(indices[i], indices[j]);
   }

   size_t processed = 0;
   for (size_t rank : indices) {
   if(stop_event.load()) break;

   gen.unrank(rank);
   Int currentKey;
   currentKey.Set(&BASE_KEY);

   const vector<int>& flips = gen.get();

   // Apply flips
   for (int pos : flips) {
   Int mask;
   mask.SetInt32(1);
   mask.ShiftL(pos);

   Int temp;
   temp.Set(&currentKey);
   temp.ShiftR(pos);

   if (temp.IsEven()) {
   currentKey.Add(&mask);
   } else {
   currentKey.Sub(&mask);
   }
   }

   // Verify key length
   string keyStr = currentKey.GetBase16();
   keyStr = string(64 - keyStr.length(), '0') + keyStr;

   #pragma omp critical
   {
   g_threadPrivateKeys[threadId] = keyStr;
   }

   // Compute public key and process in batches
   Point startPoint = secp->ComputePublicKey(&currentKey);
   Int startPointX, startPointY, startPointXNeg;
   startPointX.Set(&startPoint.x);
   startPointY.Set(&startPoint.y);
   startPointXNeg.Set(&startPointX);
   startPointXNeg.ModNeg();

   // Compute deltaX values in batches of 4 (optimized)
   for (int i = 0; i < POINTS_BATCH_SIZE; i += 4) {
   deltaX[i].ModSub(&plusPoints[i].x, &startPointX);
   deltaX[i+1].ModSub(&plusPoints[i+1].x, &startPointX);
   deltaX[i+2].ModSub(&plusPoints[i+2].x, &startPointX);
   deltaX[i+3].ModSub(&plusPoints[i+3].x, &startPointX);
   }
   modGroup.Set(deltaX.data());
   modGroup.ModInv();

   // Process plus and minus points in batches
   for (int i = 0; i < POINTS_BATCH_SIZE; i += 4) {
   for (int j = 0; j < 4; j++) {
   // Plus points (0..255)
   Int deltaY; deltaY.ModSub(&plusPoints[i+j].y, &startPointY);
   Int slope; slope.ModMulK1(&deltaY, &deltaX[i+j]);
   Int slopeSq; slopeSq.ModSquareK1(&slope);

   pointBatchX[i+j].Set(&startPointXNeg);
   pointBatchX[i+j].ModAdd(&slopeSq);
   pointBatchX[i+j].ModSub(&plusPoints[i+j].x);

   Int diffX; diffX.ModSub(&startPointX, &pointBatchX[i+j]);
   diffX.ModMulK1(&slope);

   pointBatchY[i+j].Set(&startPointY);
   pointBatchY[i+j].ModNeg();
   pointBatchY[i+j].ModAdd(&diffX);

   // Minus points (256..511)
   deltaY.ModSub(&minusPoints[i+j].y, &startPointY);
   slope.ModMulK1(&deltaY, &deltaX[i+j]);
   slopeSq.ModSquareK1(&slope);

   pointBatchX[POINTS_BATCH_SIZE+i+j].Set(&startPointXNeg);
   pointBatchX[POINTS_BATCH_SIZE+i+j].ModAdd(&slopeSq);
   pointBatchX[POINTS_BATCH_SIZE+i+j].ModSub(&minusPoints[i+j].x);

   diffX.ModSub(&startPointX, &pointBatchX[POINTS_BATCH_SIZE+i+j]);
   diffX.ModMulK1(&slope);

   pointBatchY[POINTS_BATCH_SIZE+i+j].Set(&startPointY);
   pointBatchY[POINTS_BATCH_SIZE+i+j].ModNeg();
   pointBatchY[POINTS_BATCH_SIZE+i+j].ModAdd(&diffX);
   }
   }

   // Process keys in optimized batches
   int localBatchCount = 0;
   for (int i = 0; i < fullBatchSize && localBatchCount < HASH_BATCH_SIZE; i++) {
   Point tempPoint;
   tempPoint.x.Set(&pointBatchX[i]);
   tempPoint.y.Set(&pointBatchY[i]);

   // Convert to compressed public key
   localPubKeys[localBatchCount][0] = tempPoint.y.IsEven() ? 0x02 : 0x03;
   for (int j = 0; j < 32; j++) {
   localPubKeys[localBatchCount][1 + j] = pointBatchX[i].GetByte(31 - j);
   }
   pointIndices[localBatchCount] = i;
   localBatchCount++;

   if (localBatchCount == HASH_BATCH_SIZE) {
   computeHash160BatchBinSingle(localBatchCount, localPubKeys, localHashResults);

   localComparedCount += HASH_BATCH_SIZE;

   __m256i target = _mm256_loadu_si256(reinterpret_cast<const __m256i*>(TARGET_HASH160_RAW.data()));

   for (int j = 0; j < HASH_BATCH_SIZE; j++) {
   __m256i result = _mm256_loadu_si256(reinterpret_cast<const __m256i*>(localHashResults[j]));

   int mask = _mm256_movemask_epi8(_mm256_cmpeq_epi8(result, target));

   const int HASH160_MASK = (1 << 20) - 1;

   if ((mask & HASH160_MASK) == HASH160_MASK) {
   bool fullMatch = true;
   for (int k = 0; k < 20; k++) {
   if (localHashResults[j][k] != TARGET_HASH160_RAW[k]) {
   fullMatch = false;
   break;
   }
   }

   if (fullMatch) {
   auto tEndTime = chrono::high_resolution_clock::now();
   globalElapsedTime = chrono::duration<double>(tEndTime - tStart).count();
   mkeysPerSec = (double)(globalComparedCount + localComparedCount) / globalElapsedTime / 1e6;

   Int foundKey;
   foundKey.Set(&currentKey);
   int idx = pointIndices[j];
   if (idx < POINTS_BATCH_SIZE) {
   Int offset; offset.SetInt32(idx);
   foundKey.Add(&offset);
   } else {
   Int offset; offset.SetInt32(idx - POINTS_BATCH_SIZE);
   foundKey.Sub(&offset);
   }

   string hexKey = foundKey.GetBase16();
   hexKey = string(64 - hexKey.length(), '0') + hexKey;

   lock_guard<mutex> lock(result_mutex);
   results.push(make_tuple(hexKey, total_checked.load(), flip_count));
   stop_event.store(true);
   return;
   }
   }
   }

   // Count this as one combination checked
   processed++;
   if(processed % 1000 == 0) {
   total_checked += 1000;
   }
   localBatchCount = 0;

   // Progress reporting - once per second
   auto now = chrono::high_resolution_clock::now();
   if (chrono::duration<double>(now - lastReportTime.load()).count() >= 1.0 ||
   processed == indices.size()) {

   globalElapsedTime = chrono::duration<double>(now - tStart).count();
   globalComparedCount += localComparedCount;
   localComparedCount = 0;
   mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

   double progress = min(100.0, (double)total_checked / total_combinations * 100.0);

   lock_guard<mutex> lock(progress_mutex);
   moveCursorTo(0, 9);

   cout << "Progress: " << fixed << setprecision(6) << progress << "%\n";
   cout << "Processed: " << total_checked << " / " << total_combinations << "\n";
   cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";
   cout << "Elapsed Time: " << formatElapsedTime(globalElapsedTime) << "\n";
   cout.flush();

   lastReportTime.store(now);
   }
   }
   }
   }
   total_checked += (processed % 1000);
}

void printUsage(const char* programName) {
   cout << "Usage: " << programName << " [options]\n";
   cout << "Options:\n";
   cout << " -p, --puzzle NUM Puzzle number to solve (default: 20)\n";
   cout << " -t, --threads NUM Number of CPU cores to use (default: all)\n";
   cout << " -f, --flips NUM Override default flip count for puzzle\n";
   cout << " -h, --help Show this help message\n";
   cout << "\nExample:\n";
   cout << " " << programName << " -p 38 -t 8 -f 21\n";
}

int main(int argc, char* argv[]) {
   signal(SIGINT, signalHandler);
   initConsole();

   // Parse command line arguments
   int opt;
   int option_index = 0;
   static struct option long_options[] = {
   {"puzzle", required_argument, 0, 'p'},
   {"threads", required_argument, 0, 't'},
   {"flips", required_argument, 0, 'f'},
   {"help", no_argument, 0, 'h'},
   {0, 0, 0, 0}
   };

   while ((opt = getopt_long(argc, argv, "p:t:f:h", long_options, &option_index)) != -1) {
   if (opt == -1) break;

   switch (opt) {
   case 'p':
   PUZZLE_NUM = atoi(optarg);
   if (PUZZLE_NUM < 20 || PUZZLE_NUM > 68) {
   cerr << "Error: Puzzle number must be between 20 and 68\n";
   return 1;
   }
   break;
   case 't':
   WORKERS = atoi(optarg);
   if (WORKERS < 1) {
   cerr << "Error: Thread count must be at least 1\n";
   return 1;
   }
   break;
   case 'f':
   FLIP_COUNT = atoi(optarg);
   if (FLIP_COUNT < 1) {
   cerr << "Error: Flip count must be at least 1\n";
   return 1;
   }
   break;
   case 'h':
   printUsage(argv[0]);
   return 0;
   default:
   printUsage(argv[0]);
   return 1;
   }
   }

   // Initialize timing at the very start
   tStart = chrono::high_resolution_clock::now();

   Secp256K1 secp;
   secp.Init();

   auto puzzle_it = PUZZLE_DATA.find(PUZZLE_NUM);
   if (puzzle_it == PUZZLE_DATA.end()) {
   cerr << "Error: Invalid puzzle number\n";
   return 1;
   }

   auto [DEFAULT_FLIP_COUNT, TARGET_HASH160_HEX, PRIVATE_KEY_DECIMAL] = puzzle_it->second;

   // Use override flip count if provided, otherwise use puzzle default
   if (FLIP_COUNT == -1) {
   FLIP_COUNT = DEFAULT_FLIP_COUNT;
   }

   TARGET_HASH160 = TARGET_HASH160_HEX;

   // Convert target hash to bytes
   for (size_t i = 0; i < 20; i++) {
   TARGET_HASH160_RAW[i] = stoul(TARGET_HASH160.substr(i * 2, 2), nullptr, 16);
   }

   // Set base key from decimal string
   BASE_KEY.SetBase10(const_cast<char*>(PRIVATE_KEY_DECIMAL.c_str()));

   // Verify base key
   Int testKey;
   testKey.SetBase10(const_cast<char*>(PRIVATE_KEY_DECIMAL.c_str()));
   if (!testKey.IsEqual(&BASE_KEY)) {
   cerr << "Base key initialization failed!\n";
   return 1;
   }

   if (BASE_KEY.GetBitLength() > PUZZLE_NUM) {
   cerr << "Base key exceeds puzzle bit length!\n";
   return 1;
   }

   // Calculate total combinations
   total_combinations = CombinationGenerator::combinations_count(PUZZLE_NUM, FLIP_COUNT);

   // Format base key for display
   string paddedKey = BASE_KEY.GetBase16();
   size_t firstNonZero = paddedKey.find_first_not_of('0');
   paddedKey = paddedKey.substr(firstNonZero);
   // Add 0x prefix
   paddedKey = "0x" + paddedKey;

   clearTerminal();
   // Print initial header
   cout << "=======================================\n";
   cout << "== Mutagen Puzzle Solver by Denevron ==\n";
   cout << "=======================================\n";
   cout << "Starting puzzle: " << PUZZLE_NUM << " (" << PUZZLE_NUM << "-bit)\n";
   cout << "Target HASH160: " << TARGET_HASH160.substr(0, 10) << "..." << TARGET_HASH160.substr(TARGET_HASH160.length()-10) << "\n";
   cout << "Base Key: " << paddedKey << "\n";
   cout << "Flip count: " << FLIP_COUNT << " ";
   if (FLIP_COUNT != DEFAULT_FLIP_COUNT) {
   cout << "(override, default was " << DEFAULT_FLIP_COUNT << ")";
   }
   cout << "\n";
   cout << "Total combinations: " << total_combinations << "\n";
   cout << "Using: " << WORKERS << " threads\n";
   g_threadPrivateKeys.resize(WORKERS, "0");
   vector<thread> threads;

   // Initialize random number generators for each thread with different seeds
   vector<mt19937_64> rngs(WORKERS);
   random_device rd;
   for (int i = 0; i < WORKERS; i++) {
   rngs[i].seed(rd() + i); // Different seed for each thread
   }

   // Distribute work with precise ranges
   size_t comb_per_thread = total_combinations / WORKERS;
   size_t remainder = total_combinations % WORKERS;

   vector<size_t> thread_starts(WORKERS+1);
   for(int i = 0; i < WORKERS; i++) {
   thread_starts[i] = i * comb_per_thread + min((size_t)i, remainder);
   }
   thread_starts[WORKERS] = total_combinations;

   // Launch threads
   for(int i = 0; i < WORKERS; i++) {
   threads.emplace_back(worker, &secp, PUZZLE_NUM, FLIP_COUNT, i,
   thread_starts[i], thread_starts[i+1], ref(rngs[i]));
   }

   for (auto& t : threads) {
   if (t.joinable()) t.join();
   }

   if (!results.empty()) {
   auto [hex_key, checked, flips] = results.front();
   globalElapsedTime = chrono::duration<double>(chrono::high_resolution_clock::now() - tStart).count();
   mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

   string compactHex = hex_key;
   size_t firstNonZero = compactHex.find_first_not_of('0');
   compactHex = "0x" + compactHex.substr(firstNonZero);

   cout << "=======================================\n";
   cout << "=========== SOLUTION FOUND ============\n";
   cout << "=======================================\n";
   cout << "Private key: " << compactHex << "\n";
   cout << "Checked " << checked << " combinations\n";
   cout << "Bit flips: " << flips << endl;
   cout << "Time: " << fixed << setprecision(2) << globalElapsedTime << " seconds ("
   << formatElapsedTime(globalElapsedTime) << ")\n";
   cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";

   // Save solution
   ofstream out("puzzle_" + to_string(PUZZLE_NUM) + "_solution.txt");
   if (out) {
   out << hex_key;
   out.close();
   cout << "Solution saved to puzzle_" << PUZZLE_NUM << "_solution.txt\n";
   } else {
   cerr << "Failed to save solution to file!\n";
   }
   } else {
   globalElapsedTime = chrono::duration<double>(chrono::high_resolution_clock::now() - tStart).count();
   mkeysPerSec = (double)globalComparedCount / globalElapsedTime / 1e6;

   cout << "\n\nNo solution found after checking all " << total_combinations << " combinations\n";
   cout << "Time: " << fixed << setprecision(2) << globalElapsedTime << " seconds ("
   << formatElapsedTime(globalElapsedTime) << ")\n";
   cout << "Speed: " << fixed << setprecision(2) << mkeysPerSec << " Mkeys/s\n";

   // Verify puzzle parameters
   cout << "\nVerification:\n";
   cout << "Puzzle: " << PUZZLE_NUM << "\n";
   cout << "Base Key: " << paddedKey << "\n";
   cout << "Target Hash160: " << TARGET_HASH160 << "\n";
   cout << "Flip Count: " << FLIP_COUNT << "\n";
   }

   return 0;
}

Tested on low puzzles.
For puzzle 68 code still fails (out of memory).

speed per core?

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 31/03/2025, 21:37:10 UTC

Quote from: nomachine on Today at 08:54:01 PM

Quote from: user_not_here on Today at 08:37:03 PM

=======================================
== Mutagen Puzzle Solver by Denevron ==
=======================================
Searching Puzzle 25 (25-bit)
Base Key: 0000000000...0000DC2A04
Target HASH160: 2f396b29b2...1835bd3dbb
Predicted Flip Count: 12 bits
Total Combinations: 5200300
Using 8 workers...
Using AVX2 optimizations...
Progress: 11.345499% (590002/5200300)

Progress: 0.000001% (284400000/36565184442289825) [44.420601 Mkeys/s]

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 30/03/2025, 19:00:43 UTC

Quote from: Denevron on Today at 04:25:31 PM

Quote from: b0dre on Today at 04:16:52 PM

Quote from: nomachine on Today at 01:47:29 PM

Quote from: Denevron on Today at 12:29:24 PM

Quote from: nomachine on Today at 12:18:24 PM

Quote from: Denevron on Today at 12:04:50 PM

can be called a mutagen Grin

It's not that simple, but it speeds up the process, you've noticed it yourself)

you can add it to Cyclon as a new method, and there you just don’t calculate the puzzle range, but enter the key by which the bits will change, and it’s better not in random mode, but in sequential mode, because the random mode will loop and that’s it, and you will be forever, say, chasing in 30 bits, and you won’t find it, and it will be, say, in 33 bits

Code:

import time
import multiprocessing
import secp256k1 as ice
import random
from math import comb
import numpy as np

# Configuration
TARGET_ADDR = "1HsMJxNiV7TLxmoF6uJNkydxPFDog4NQum" # Puzzle 20
BASE_KEY = "000000000000000000000000000000000000000000000000000000000005749f" # Puzzle 19 Private Key
PUZZLE_NUM = 20
WORKERS = multiprocessing.cpu_count()

# Historical flip counts
FLIP_TABLE = {
20:8, 21:8, 22:9, 23:9, 24:10, 25:11, 26:12, 27:13, 28:14, 29:15,
30:16, 31:17, 32:18, 33:19, 34:20, 35:21, 36:22, 37:23, 38:24, 39:25,
40:26, 41:27, 42:28, 43:29, 44:30, 45:31, 46:32, 47:33, 48:34, 49:35,
50:36, 51:37, 52:38, 53:39, 54:40, 55:41, 56:42, 57:43, 58:44, 59:45,
60:46, 61:47, 62:48, 63:49, 64:50, 65:51, 66:52, 67:53
}

def predict_flips(puzzle_num):
"""Predict flip count using linear regression with bounds"""
if puzzle_num in FLIP_TABLE:
return FLIP_TABLE[puzzle_num]

# Linear regression
x = np.array(list(FLIP_TABLE.keys()))
y = np.array(list(FLIP_TABLE.values()))
coeffs = np.polyfit(x, y, 1)
predicted = round(coeffs[0] * puzzle_num + coeffs[1])

# Apply bounds (50-60% of bits)
bit_length = puzzle_num
lower = max(8, int(bit_length * 0.5))
upper = min(int(bit_length * 0.6), bit_length-5)
return min(max(predicted, lower), upper)

def mutate_key(base_int, flip_positions):
"""Ultra-fast bit flipping using XOR mask"""
flip_mask = sum(1 << bit for bit in flip_positions)
return base_int ^ flip_mask

def worker(base_int, bit_length, flip_count, results, stop_event):
checked = 0
bit_positions = list(range(bit_length))
start_time = time.time()

while not stop_event.is_set():
# Generate random flip positions
flip_pos = random.sample(bit_positions, flip_count)

# Create mutated key
priv_int = mutate_key(base_int, flip_pos)
checked += 1

# Generate address
addr = ice.privatekey_to_address(0, True, priv_int)

if addr == TARGET_ADDR:
hex_key = "%064x" % priv_int
actual_flips = bin(base_int ^ priv_int).count('1')
results.put((hex_key, checked, actual_flips))
stop_event.set()
return

# Progress update
if checked % 10000 == 0:
elapsed = time.time() - start_time
speed = checked/elapsed if elapsed > 0 else 0
print(f"Checked {checked:,} | {speed:,.0f} keys/sec", end='\r')

def parallel_search():
bit_length = PUZZLE_NUM
base_int = int(BASE_KEY, 16)
flip_count = predict_flips(PUZZLE_NUM)
total_combs = comb(bit_length, flip_count)

print(f"\nSearching Puzzle {PUZZLE_NUM} (256-bit)")
print(f"Base Key: {BASE_KEY}")
print(f"Target: {TARGET_ADDR}")
print(f"Predicted Flip Count: {flip_count} bits")
print(f"Total Possible Combinations: 2^{int(np.log2(total_combs)):,}")
print(f"Using {WORKERS} workers...\n")

manager = multiprocessing.Manager()
results = manager.Queue()
stop_event = manager.Event()

start_time = time.time()

with multiprocessing.Pool(WORKERS) as pool:
for _ in range(WORKERS):
pool.apply_async(worker, (base_int, bit_length, flip_count, results, stop_event))

try:
hex_key, checked, actual_flips = results.get(timeout=86400) # 24h timeout
stop_event.set()

elapsed = time.time() - start_time
print(f"\nFound in {elapsed/3600:.2f} hours")
print(f"Private Key: {hex_key}")
print(f"Actual Bit Flips: {actual_flips}")
print(f"Keys Checked: {checked:,}")
print(f"Speed: {checked/elapsed:,.0f} keys/sec")

solution_file = f"puzzle_{PUZZLE_NUM}_solution.txt"
with open(solution_file, "w") as f:
f.write(hex_key)
print(f"\nSolution saved to {solution_file}")
return hex_key

except:
print("\nKey not found in 24 hours - try adjusting flip count ±2")
return None
finally:
pool.terminate()

if __name__ == "__main__":
flip_count = predict_flips(PUZZLE_NUM) # First calculate flip_count
print(f"Predicted flip count for {PUZZLE_NUM}: {flip_count} bits")
solution = parallel_search()

Here is my Python version.
I still think this is useless. If someone convinces me otherwise. Grin

you don't need to use random, so you won't find a match, only if you guess the number of bits Cheesy

Code:

import time
import multiprocessing
import secp256k1 as ice
from math import comb
import numpy as np
from itertools import combinations # For sequential flip generation

# Configuration
TARGET_ADDR = "1HsMJxNiV7TLxmoF6uJNkydxPFDog4NQum" # Puzzle 20
BASE_KEY = "000000000000000000000000000000000000000000000000000000000005749f" # Puzzle 19 Private Key
PUZZLE_NUM = 20
WORKERS = multiprocessing.cpu_count()

# Historical flip counts (unchanged)
FLIP_TABLE = {
   20:8, 21:8, 22:9, 23:9, 24:10, 25:11, 26:12, 27:13, 28:14, 29:15,
   30:16, 31:17, 32:18, 33:19, 34:20, 35:21, 36:22, 37:23, 38:24, 39:25,
   40:26, 41:27, 42:28, 43:29, 44:30, 45:31, 46:32, 47:33, 48:34, 49:35,
   50:36, 51:37, 52:38, 53:39, 54:40, 55:41, 56:42, 57:43, 58:44, 59:45,
   60:46, 61:47, 62:48, 63:49, 64:50, 65:51, 66:52, 67:53
}

def predict_flips(puzzle_num):
   """Predict flip count using linear regression with bounds (unchanged)"""
   if puzzle_num in FLIP_TABLE:
   return FLIP_TABLE[puzzle_num]

   # Linear regression
   x = np.array(list(FLIP_TABLE.keys()))
   y = np.array(list(FLIP_TABLE.values()))
   coeffs = np.polyfit(x, y, 1)
   predicted = round(coeffs[0] * puzzle_num + coeffs[1])

   # Apply bounds (50-60% of bits)
   bit_length = puzzle_num
   lower = max(8, int(bit_length * 0.5))
   upper = min(int(bit_length * 0.6), bit_length-5)
   return min(max(predicted, lower), upper)

def mutate_key(base_int, flip_positions):
   """Ultra-fast bit flipping using XOR mask (unchanged)"""
   flip_mask = sum(1 << bit for bit in flip_positions)
   return base_int ^ flip_mask

def worker(base_int, bit_length, flip_count, results, stop_event, start_index, end_index):
   """Modified worker for sequential search"""
   checked = 0
   bit_positions = list(range(bit_length))
   start_time = time.time()

   # Generate all combinations in the assigned range
   all_combinations = combinations(bit_positions, flip_count)

   # Skip to the start index (for parallelization)
   for _ in range(start_index):
   next(all_combinations, None)

   # Iterate through combinations sequentially
   for flip_pos in all_combinations:
   if stop_event.is_set():
   return

   if checked >= (end_index - start_index):
   return

   priv_int = mutate_key(base_int, flip_pos)
   checked += 1

   # Generate address
   addr = ice.privatekey_to_address(0, True, priv_int)

   if addr == TARGET_ADDR:
   hex_key = "%064x" % priv_int
   actual_flips = bin(base_int ^ priv_int).count('1')
   results.put((hex_key, checked, actual_flips))
   stop_event.set()
   return

   # Progress update
   if checked % 10000 == 0:
   elapsed = time.time() - start_time
   speed = checked/elapsed if elapsed > 0 else 0
   print(f"Checked {checked:,} | {speed:,.0f} keys/sec", end='\r')

def parallel_search():
   bit_length = PUZZLE_NUM
   base_int = int(BASE_KEY, 16)
   flip_count = predict_flips(PUZZLE_NUM)
   total_combs = comb(bit_length, flip_count)

   print(f"\nSearching Puzzle {PUZZLE_NUM} (256-bit)")
   print(f"Base Key: {BASE_KEY}")
   print(f"Target: {TARGET_ADDR}")
   print(f"Predicted Flip Count: {flip_count} bits")
   print(f"Total Possible Combinations: 2^{int(np.log2(total_combs)):,}")
   print(f"Using {WORKERS} workers...\n")

   manager = multiprocessing.Manager()
   results = manager.Queue()
   stop_event = manager.Event()

   start_time = time.time()

   # Split work into chunks for each worker
   chunk_size = total_combs // WORKERS
   ranges = [(i * chunk_size, (i + 1) * chunk_size) for i in range(WORKERS)]

   with multiprocessing.Pool(WORKERS) as pool:
   for start, end in ranges:
   pool.apply_async(worker, (base_int, bit_length, flip_count, results, stop_event, start, end))

   try:
   hex_key, checked, actual_flips = results.get(timeout=86400) # 24h timeout
   stop_event.set()

   elapsed = time.time() - start_time
   print(f"\nFound in {elapsed/3600:.2f} hours")
   print(f"Private Key: {hex_key}")
   print(f"Actual Bit Flips: {actual_flips}")
   print(f"Keys Checked: {checked:,}")
   print(f"Speed: {checked/elapsed:,.0f} keys/sec")

   solution_file = f"puzzle_{PUZZLE_NUM}_solution.txt"
   with open(solution_file, "w") as f:
   f.write(hex_key)
   print(f"\nSolution saved to {solution_file}")
   return hex_key

   except:
   print("\nKey not found in 24 hours - try adjusting flip count ±2")
   return None
   finally:
   pool.terminate()

if __name__ == "__main__":
   flip_count = predict_flips(PUZZLE_NUM)
   print(f"Predicted flip count for {PUZZLE_NUM}: {flip_count} bits")
   solution = parallel_search()

Here is the script. I don't see any difference here. To solve puzzle 68 you need impossible speed.

21 is not found!

For 21 you need to put the key from the 20th puzzle, set hash160 from 21 and put 9 bits - then you will find) those bits that are in the nomachine example, this is just an example, this is not the exact number of bits that need to be changed) Grin

Found, but now 22 is not.

Post

Topic

Board Bitcoin Discussion

Re: Bitcoin puzzle transaction ~32 BTC prize to who solves it

b0dre

on 30/03/2025, 16:16:52 UTC

Quote from: nomachine on Today at 01:47:29 PM

Quote from: Denevron on Today at 12:29:24 PM

Quote from: nomachine on Today at 12:18:24 PM

Quote from: Denevron on Today at 12:04:50 PM

can be called a mutagen Grin

Code:

Here is my Python version.
I still think this is useless. If someone convinces me otherwise. Grin

you don't need to use random, so you won't find a match, only if you guess the number of bits Cheesy

Code:

Here is the script. I don't see any difference here. To solve puzzle 68 you need impossible speed.

21 is not found!