I've added a few more performance tweaks and pushed the kHash overflow fix to github.
The other performance tweaks I have
not pushed because they hard code 64-bit mode for some of the hash algorithms. For the sake of openness, here's the changes:
keccak.c#define SPH_KECCAK_64 1
#define SPH_KECCAK_UNROLL 0
#define SPH_KECCAK_NOCOPY 0
metis.c#define SPH_KECCAK_NOCOPY 0
#define SPH_64 1
I've also compiled the code through Visual Studio 2010 with performance guided optimization and the windows builds now
regularly meet or exceeds the jh00 speeds.
I've built two versions: one with the /favor:INTEL64 flag and one with /favor:AMD64. Pick whichever matches your processor.
edit: AMD users! Try the favorAMD build first, then the favorBLEND build. Apparently /favor:AMD64 prefers older AMD processors (Athlon, Opteron, and Sempron). See which works best for you!
Download Links:https://mega.co.nz/#F!YsZSHYKA!IC8LK_MBGwqC-gWOpO7zoQhttps://www.dropbox.com/sh/wtxvxvkirxax2vj/9P_Rxb9V1y (dropbox mirror)
Thanks! A 20Kh/s boost.