... and you can get 27,000,000 on high end CPU.
Actually some more than 100 Mkeys/s

By the way my fists test also start with some 50 Thousand keys/s
i finished a first version for gpu and i get 250.000 keys per second for now
Please don't start a code for GPU when you don't even reach the max limit for CPU, a shitty code in CPU will only lead a shitty code on GPU.