Submitted another speedup in quark. More is comming.
I will improve the bandwidth of the hashes since half of the buffers are not used. Then the intensity can be increased..
UNUSED BUFFERS--
Do unused buffers have anything to do with the poor performance of GTX 960/970 relative to GTX 750ti on Lyra2? Or is it the memory controller? Or both? My 4GB GTX 960 is slower than a 2GB 750ti mining Lyra2, and it just indicates improper coding to match the card's capacity. Both cards are on a Win 7 x64 system.
DJM34, don't be afraid to chime in, you are the code master of Lyra2. --scryptr