Post
Topic
Board Development & Technical Discussion
Re: Solving ECDLP with Kangaroos: Part 1 + 2 + RCKangaroo
by
kTimesG
on 08/02/2025, 21:59:16 UTC
Code:
./RCKangaroo
********************************************************************************
*                    RCKangaroo v3.0  (c) 2024 RetiredCoder                    *
********************************************************************************

This software is free and open-source: https://github.com/RetiredC
It demonstrates fast GPU implementation of SOTA Kangaroo method for solving ECDLP
Linux version
CUDA devices: 1, CUDA driver/runtime: 12.8/12.5
GPU 0: NVIDIA GeForce RTX 5090, 31.36 GB, 170 CUs, cap 12.0, PCI 33, L2 size: 98304 KB
Total GPUs for work: 1

BENCHMARK MODE

Solving point: Range 78 bits, DP 16, start...
SOTA method, estimated ops: 2^39.202, RAM for DPs: 0.547 GB. DP and GPU overheads not included!
Estimated DPs per kangaroo: 9.236.
GPU 0: allocated 3176 MB, 1044480 kangaroos. OldGpuMode: No
GPUs started...
BENCH: Speed: 9393 MKeys/s, Err: 0, DPs: 2848K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9382 MKeys/s, Err: 0, DPs: 4281K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9351 MKeys/s, Err: 0, DPs: 5713K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9340 MKeys/s, Err: 0, DPs: 7147K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9294 MKeys/s, Err: 0, DPs: 8565K/9646K, Time: 0d:00h:01m/0d:00h:01m
BENCH: Speed: 9294 MKeys/s, Err: 0, DPs: 9983K/9646K, Time: 0d:00h:01m/0d:00h:01m
Stopping work ...
Point solved, K: 1.345 (with DP and GPU overheads)

Points solved: 1, average K: 1.345 (with DP and GPU overheads)

Solving point: Range 78 bits, DP 16, start...
SOTA method, estimated ops: 2^39.202, RAM for DPs: 0.547 GB. DP and GPU overheads not included!
Estimated DPs per kangaroo: 9.236.
GPU 0: allocated 3176 MB, 1044480 kangaroos. OldGpuMode: No
GPUs started...
BENCH: Speed: 9294 MKeys/s, Err: 0, DPs: 1386K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9289 MKeys/s, Err: 0, DPs: 2805K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9294 MKeys/s, Err: 0, DPs: 4222K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9258 MKeys/s, Err: 0, DPs: 5638K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9258 MKeys/s, Err: 0, DPs: 7056K/9646K, Time: 0d:00h:00m/0d:00h:01m
BENCH: Speed: 9309 MKeys/s, Err: 0, DPs: 8474K/9646K, Time: 0d:00h:01m/0d:00h:01m
BENCH: Speed: 9289 MKeys/s, Err: 0, DPs: 9909K/9646K, Time: 0d:00h:01m/0d:00h:01m
Stopping work ...
Point solved, K: 1.320 (with DP and GPU overheads)

Points solved: 2, average K: 1.333 (with DP and GPU overheads)

...

Points solved: 7, average K: 1.734 (with DP and GPU overheads)

Hypotetical scenario: a RTX 5090 can do at least 13.0 G jumps/s at DP 32. Are there plans to improve RCKangaroo or is 9.3 Gk/s still a "very good" speed, compared to an optimized version?

I am disappointed in the 5090 so far, I only got at most a 20% speed up compared to 4090.