Can you add to your table what are the change between each lines.
Sure! I
will update have updated my post accordingly.
Did you try on windows ?
I haven't because I do not have a Windows rig, and likely will not test this because I do not want to reimage or deal with Windows taking over my boot record

See my diff in a previous post above for the changes I made. If you are capable of compiling this, I'd be very curious to see the results.
Must say I am a bit surprise bu the 23MHash/s. You should run a little longer to make sure everything is stable.
Configurations with the highest hashrates were stable enough to run in the sense that the program would not crash, however they were not stable enough to provide valid shares. For instance, 384 blocks x 768 threads @ 23213 khash/s attempted 27 shares, but only 16 were valid (less than half that of the 550x768 config).