Excellent job

please give us the compiled beta version for various benchmarks. it may helps you to get things done..
Compiled binary? Pffft. Then the next guy won't be able to speed up Bitcrack

I'll just put the source on GitHub once I'm find the optimal level of tuning.
Source code is useless for me as i am a noob in those programming Except little bit in python and can't able to compile too
Whether you're using Linux or Windows the compilation process is actually straightforward. On Linux you just open a terminal and run
make BUILD_CUDA=1, and I have an autodetection script that will detect the compute cap of the GPUs.
On Windows you go to Visual Studio > Build Solution, and I believe there's an environment variable for the CUDA compute cap you can set from the Properties, however I haven't compiled Windows stuff in a while so I need to recheck where it is.
In all seriousness though, it is better not to make a compiled binary because assembly for all of the CUDA compute caps have to be included in the binary and that takes a long time to make, longer than just building against one compute cap.
(Though I think I see your point about Windows compilation, VS
is a pain in the ass to download, so maybe there will be a compiled Windows version, but for Linux this is infeasible unless I find out how to make an AppImage).
I don't care if I use it later or earlier. There are many people in our community who can compile it in the Windows version. I will use it after that, I am very happy to see that People like you share their works and help each other. This is definitely the identity of a good community. I really appreciate all of you Those who carry out such a difficult tasks. hats off