There is no instant solution and it all boils down to the fact that you have to know C++ or some other compiled language like Rust........
Khm... Is there Python ECDSA (for secp256k1) library works with CUDA cores? I searched and didn't find.
You have to write your own.
I don't like the existing solution in C++ either.
Whether it is SECP256K1.cpp or OpenSSL
There are many calls through the various libraries. My goal is to do it all from one cpp file.