Hi
I can eventually convert this code using my own secp256k1 library for CUDA (based on Jean Luc Pons Kangaroo ).
It can achieve up to 70M scalar mult /sec on a rtx 3070.
A python implementation with pycuda can be do easily
But sorry for this trivial question :
What the purpose of this script?
Regards
Fanch