I prefer to use result.s0 and such anyways, but I don't use the offset feature as it only lowers 1 APU but raises GPR making the card run hotter. Anyways, I found time to fix it and get it working, all three outputs are good. I was mistaken with the output buffer since there is no (hi - 3) with vectors 3 as there is no result.w
So, now it correctly reports (lo - 1), (hi -1), or (lo - 3). Does anyone want to try it out?