Interesting, that a new version is FASTER on a top-tier GPU like 5090. I have seen 9Gkeys/s On 5090 with —grid 1024,512. But another version of 5090 is slower, speed around 8.2-8.3 Gkeys/s. It depends on power consuption limit.
But I will check speed differences between versions. 4060 speed is the same both version.
I just wanted to share my experience after testing both optimized tools.
In my comparison,
[VanitySearch/fixedpaul] runs noticeably faster than the
[CudaCyclone/Your Tool].
The difference in speed really stands out, and it makes a big impact on performance.
