I think i have same issue (0.5.1). 6 x vega56 with timmings run maybe hour and then crach.
Watchdog GPU 0: stuck in enqueue, reporting.
GPU 0: detected DEAD (03:00.0), will execute restart script watchdog.bat
[2019-06-12 11:45:51] GPU 5: detected DEAD (03:00.0), no restart script configured, will continue mining.
Bus 3? What about the others with same problem?
H110 d3a mobo, bus 3 is the x16 slot on this mobo. igfx is turned on as primary display adapter.
it's always the same card/bus. I'll try switching it around with another vega and see what happens.
BTW where can i find some info about setting up a watchdog script (bat)? Like how to send runtime commands to teamredminer. Or the id of the open/running miner?
I got bored last night and wrote this if any one wants a setup template
@echo off
set ALGO=
set POOL=
set PORT=
set WALLET=
set PASSWORD=x
set DEVICES=
set INTENSITY=
:: !! optional: create logfile(s)? set LOG=YES
set LOG=YES
:: !! optional: reorder GPU's according to bus number. set REORDER=YES
:: !! NOTE! Intensity and Devices will correspond to reorder.
set REORDER=
:: !! optional: works only for cryptonote algos and if pool allows! Otherwise leave blank. (soesn't work with Nicehash)
set RIGNAME=
set DIFFICULTY=
:: !! optional: name of mining pool. (for logfile purposes only. Can be left blank.)
set POOLNAME=
:: !! optional: pause for error message? set PAUSE=YES (Prevents the command window from closing if you have a problem launching the miner.)
set PAUSE=
:: --------------------Change below settings at own risk!--------------------------
set GPU_MAX_ALLOC_PERCENT=100
set GPU_SINGLE_ALLOC_PERCENT=100
set GPU_MAX_HEAP_SIZE=100
set GPU_USE_SYNC_OBJECTS=1
set CUR_YYYY=%date:~10,4%
set CUR_MM=%date:~4,2%
set CUR_DD=%date:~7,2%
if defined PORT set PORT=:%PORT%
if defined DEVICES set DEVICES=-d %DEVICES%
if defined RIGNAME set RIGNAME=--rig_id %RIGNAME%
if defined DIFFICULTY set DIFFICULTY=.%DIFFICULTY%
if not exist LOGS\%POOLNAME% mkdir LOGS\%POOLNAME%
if "%LOG%"=="YES" set LOG=--log_file=LOGS\%POOLNAME%\LOG_%POOLNAME%_%CUR_YYYY%.%CUR_MM%.%CUR_DD%_%ALGO%.txt
if "%REORDER%"=="YES" set REORDER=--bus_reorder
@echo on
teamredminer.exe -a %ALGO% -o %POOL%%PORT%%DIFFICULTY% -u %WALLET% -p %PASSWORD% %REORDER% %DEVICES% --cn_config=%INTENSITY% %LOG% %RIGNAME%
@if "%PAUSE%"=="YES" pause (
) else (
@exit
)
I actually just wanted some structure in my logfiles for troubleshooting these dead GPU issues but ended up with this
