Search content
Sort by

Showing 4 of 4 results by thebeardsman
Post
Topic
Board Mining support
Re: S17 Pro Issues
by
thebeardsman
on 30/01/2022, 19:51:42 UTC
I believe the problem is from flaky solder connections, these miners are plagued with them. I have found the problems tend to show up more often when cold, then when you warm them up thermal expansion can close the flaky connection enough for the miner to run. You may be able to limp along for a while, but in my experience, it is just a matter of time before it starts failing solidly and won't come up even when warm.

The same issues can cause the temperature sensor issues. Sometimes all the downstream sensors from the place where the issue is start having communication errors.

That would fit with my symptoms. I think I may have reached that point of no return today. Couldn't get it to stay hashing on all 3 boards, even with the ambient temp up in the 50F range, which is a new low for this machine.

I disabled the 2 hashboards that were reporting sensor issues and so far it's been running fine on the remaining board. I won't know if it's significantly more stable until it has a chance to run overnight, but I've accepted I will need repairs.

I see you're in the market for 17-series components, I don't suppose you offer repair services?
Post
Topic
Board Mining support
Re: Where to fix your Asic miners.
by
thebeardsman
on 30/01/2022, 19:09:58 UTC
Any recent feedback on D-Central or Myrig?
Post
Topic
Board Mining support
Re: S17 Pro Issues
by
thebeardsman
on 30/01/2022, 17:05:02 UTC
Update:

I turned it on long enough to get a log and this time it's reporting temp sensor errors from two boards:

Sun Jan 30 09:23:28 2022 daemon.err bosminer[1632]: Jan 30 16:23:28.484 ERROR bosminer_hal::sensor: Sensor hb3.11[ii_hwmon::tmp451::TMP451]: read failed: I2C error: general error Hashchip: no response for read_register(reg=0x1c) from chip One(11)

<snip>

Sun Jan 30 09:23:29 2022 daemon.err bosminer[1632]: Jan 30 16:23:29.918 ERROR bosminer_hal::sensor: Sensor hb2.8[ii_hwmon::tmp451::TMP451]: read failed: I2C error: general error Hashchip: no response for read_register(reg=0x1c) from chip One(Cool

<snip>

Sun Jan 30 09:23:30 2022 daemon.err bosminer[1632]: Jan 30 16:23:30.020 ERROR bosminer_hal::sensor: Sensor hb2.36[ii_hwmon::tmp451::TMP451]: read failed: I2C error: general error Hashchip: no response for read_register(reg=0x1c) from chip One(36)

<snip>

Sun Jan 30 09:25:03 2022 daemon.err bosminer[1632]: Jan 30 16:25:03.696 ERROR bosminer_hal::sensor: Sensor hb3.8[ii_hwmon::tmp451::TMP451]: read failed: I2C error: general error Hashchip: no response for read_register(reg=0x1c) from chip One(Cool
Sun Jan 30 09:25:03 2022 daemon.err bosminer[1632]: Jan 30 16:25:03.797 ERROR bosminer_hal::sensor: Sensor hb3.36[ii_hwmon::tmp451::TMP451]: read failed: I2C error: general error Hashchip: no response for read_register(reg=0x1c) from chip One(36)

Assuming I'm correct in interpreting the sensor IDs, looks like there are 2x sensors on hashboard 2, and 3x on hashboard 3 that aren't communicating.

Further assuming there's nothing to do about this myself if I don't trust my soldering skills in this context?
Post
Topic
Board Mining support
Topic OP
S17 Pro Issues
by
thebeardsman
on 30/01/2022, 16:20:38 UTC
Hello,

First post here but I've been lurking for a while.

I have one uppity S17 Pro that seems to be highly sensitive to low temperatures, much more so than the others. The main symptom is when incoming air gets cold (like 35F or below), it will essentially go into a reboot loop. It will boot up and start hashing for ~30-45 seconds, then it appears to lose all 3 boards and restart itself. Sometimes I can coax it back online but it's been gradually getting worse over the last couple months. Then, just a few days ago, I started seeing intermittent, slightly erratic chip temp readings from one board. This device's stability is now so low I'm at the point where I want to send it out for repair, but I thought I'd throw this out here in case it turns out to be something I can diag/repair myself.

For comparison, the other miners will run happily until incoming air gets into the teens (F). Then, they will typically reboot once and run happily again for 1-6 hours before they do it again, if they do it again. Yes, I'm sure you'll tell me that's bad for them, and I normally modulate the incoming cooling air temp but there's some diagnostic value in knowing that difference exists.

I'm running Braiins OS+, just installed the new 21.12.1 release. Didn't improve anything. Changing the power settings doesn't effect anything. If I look at the log, I see lots of "TX fifo on hashboard (n) is empty" where (n) is 1, 2, or 3. I can post more of the log if you'd like to read it. There's no mention of temp sensors or anything else. The fact that the "TX fifo" errors occur simultaneously on all three hashboards has me thinking it might be a control board issue?

Thank you for your time.