I bet you are wrong. Just check everything carefully and you will find a bug

I wouldn't believe it either, so that's fine. Everything is already checked, counted, and verified to be correct.
I verified it also for higher bit ranges, that jump on GPU.
Those figures are the total op count including DP overhead (complexity excludes DP overhead).
And I'm using more than 3 kangaroos. Pure math and skills, you should understand that.
But don't worry, I don't care anymore about this problem, so have fun with your experiment. I inserted a SHA hash in my older conjecture that this can be solved in ~1.0 sqrt, one day I might reveal what it contained.