Well boys, I honk I may have just fixed RTX 30xx compatibility


No debug switch used - speed is awful because I'm using tiny -t -p and -b settings. I didn't want to wait several minutes to see if it worked.
That being said, there's no optimization in this program either. I guess the optimization was too aggressive and it broke things. Maybe I will inch it up a few increments to see the highest that'll work without crashing.
Code and benchmarks tomorrow - I'm dead tired now.