I use nvidia smi to poll power draw and temps of my cards. If they go below 50w, write the rig name to a dropbox txt file and restart. Every hour, it checks any card goes above 80 degrees, if yes, write rig name to another dropbox txt file.
Crude solution requiring some coding but is free and works for me