I think the fastest way to generate public keys with CPU is to use ice library
So you have point P and in 2 seconds I get 3.5 million of next points
That's a terrible speed to get points of sequential keys, on a CPU. A good implementation can reach at least 20 million pubKeys / second, on a single thread, on a modest CPU.