Anyone played around with the launch config stuff for the TSIV version? I'm finding that 4x80 is far from optimal on certain systems. 6x60 gave me about a 25% boost on a GTX 770 and GTX 780, which on a GTX 860M 4x40 basically tripled my performance (from 50 H/s to 170 H/s). It would be great to hear what others are seeing with the -l parameter.
someone needs to come up with an autotune. Just sayin'...
NOTE: separate autotuning would be required for the 3 kernels of the algorithm.