Hello,
I do need a help. I'm running few rigs under ethOS on RX480 8GB. So my issue is, that on every rig in diffrent time I got problem "WATCHDOG: GPU 0 hangs in OpenCL call, exit" (different cards on different rigs and getting this error in different time in different rigs - from 60 minuts after start till few hours/days). So the problem is, that after getting message "Restarting OK, exit..." theoretically Claymore should restart, but I'm just getting "ssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssss" message all the time, while rig is on.
Does anybody know what issue it could be? For sure the matter of this message is raiser and overlocking (in some cases I'm gettin this problem with not overlocked cards)
I use Ubuntu 16.04.3 and I have this kind of problem, linked to overclocking and similar instability problems.
The problem is that it is locking something hardware, and the kernel cannot stop the thread, nor even shutdown properly.
I used
a USB watchdog, but recently i found that my Asrock H81 ProBTC
include an iTCO watchdog that is just to be configured with linux watchdog.
for restart I use the restart.sh with a shutdown -r (check thay you have a restart.sh that works, show trace to see if it is called).
if your kernel is just stuck while rebooting, try to configure the watchdog of your motheboard if there is on, or acquire a USB watchdog.
Maybe it is already configured under EthOS