Well it seems that each A100 is about 2.5 - 3 times faster than V100 so I won't even try for now. I have to ask my professor and it's going to be a pain accessing it anyway.
But it's interesting how computing power is basically the only thing required. I guess is just a matter of time until someone with access to a supercomputer might give it a try...
...and then get arrested like this guy?
https://www.itnews.com.au/news/csiro-it-contractor-spared-jail-for-mining-monero-on-supercomputer-553535 
On a more serious note, the software will have to improve to a level where it's not experimental before people start investing their resources into it.
What was the speed of your version? Wasn't it a lot slower than Jean Luc's?
So with your version people can store 256 bits in the work files, right? Is that the only difference?
If so, I am not sure why people are using yours, especially if it is slower than the original. If it is faster, I still don't know why they would use it for the puzzle since it will require more RAM/storage space.
Please enlighten me NotATether...I truly can't remember the speed and the differences in your version.