Hi guys,
I have 2 exactly the same rig, one of them is stable as rock, the other is freezing 0.5 - 2 a day I don't know why. Please help me to find a soultion. I have tried what you wrote before but nothing changed. I set up the reboot script but in this situation it is not working. Thank you in advance!
http://ballai.hu/image1.jpghttp://ballai.hu/image2.pngThat seems like a hardware issue. When it comes to the watchdog with saying which one the problem is with it is usually the first one. Looking at the pictures I would double check your risers for each slot and maybe even re-mount the one in GPU slot 0. (not always in the physical to logical slots depending on how the motherboard booted the PCI-e lanes) If you are not sure which one is which, use the nvidia-smi and change the fan speed to 0 (when not mining) to find out which slot is which for the card.
What is your overclock on the second rig for all the GPUs?