Testing the code which posted was on github with the fixes for the pool (maxpool.1gh) and compiled on windows with my gtx660 it works.
For the moment 43 accepted 7 rejected
yoda
Can you
please post the binary? I also have a gtx660.

Thank you.
Yeah please share, im on gtx770