Yes, we need the optimisation desperately. On average I am getting 1.2-1.3mh over a longer period of time (from initial 1.4mh) versus 3mh+ Wolf is getting (he has improved over 2.2mh and I am not sure if he is sharing his private miner to a few privileged). That is less than half of the hash rate of what Wolf's private miner can do. This is by no mean a small amount! With the current exchange rate of 0.000024, we are all losing a lot when mining with the public miner.
Why is this thread full of crying people? This is really pathetic! Yes, it is a significant difference in performance, but it is not devastating. You just mine 50% of what he does with the same HW, this is still fine. Instead of crying and waiting for everything served on a silver plate, you can:
1) dedicate your own time to learn CUDA programming, run NvProfiler and do your own optimizations, like Wolf0 did
2) pay someone to do optimizations for you
3) buy twice more HW you have now