Anybody having issues with monitoring the health of their rig ?
I an experiencing continuous lock-ups in cast-xmr (and stak for that matters) when I'm running hwinfo64 or gpu-z to monitor the health of the rig.
It's imperative to know temperatures on the hbm and monitor fan speeds.
4 GPU + single PSU is OK. On 8GPU + 2PSU the lockups are frequent (every hour - two).
Any other options ?
hwinfo64 works fine with vega, no speed decrease or lockups. Just disable GPU I2C Support in settings.
I've been reading through the threads trying to glean a little knowledge and it's things like this that light me up.
I've got the latest stable hwinfo64 5.60-3280. When simply disabling GPU I2C support (that single check-box), my hash-rate drops. Any other options I should be adding/dropping? ... I think that it might be working now if I keep hwinfo64 open, reset my vegas, and then check temps. Is that what you mean?
Also, since talking temps, you said you set target to 50. Is that via Wattman in Radeon Settings? For some reason I was under the impression that the temp targets in wattman weren't effective and that it all had to be done manually. Yes, I'm talking LC here. I'm also interested in keeping fans down if possible. You said hbm temp 65-67 C. Would you say over 70 is dangerous?
I use OverdriveNtool to set target temp.
I currently clock HBM high - to 1150mhz, so I keep memory temp below 65 with proper fan settings for that reason.
As for hwinfo64 issues you have ... well ... I currently use hw64_559_3270 ... didn't try most recent yet.
Make sure that other stuff isn't responsible for hashrate drop. It can be windows power saver options or other hardware related utilities (like nvidia utilities in task scheduler in my case before)