so it's also really really slow. Maybe less than 100 MK/s on the best GPU that exists.
The speeds you see in KeyHunt(-CUDA) are speeds of public key additions, not of computing different "random" private keys to public keys.
There is a script here with hashing very similar to cpuminer. I have achieved results up to 2× faster than VanitySearch or KeyHunt using _mm256 and AVX on the latest AMD processors. However, it is still slow—on the order of several thousand years
https://bitcointalk.org/index.php?topic=5532654.msg65125574#msg65125574I bet that Frozen guy and NoMachine hiding something else. There must be something faster.
