I should have been more specific: I think there's a problem if you compile it yourself on 64-bit Linux. The supplied Windows binary is 32-bit.
Ubuntu 12.04 + x86_64, AMD GPUs -- works for me. No blocks yet, but it looks busy and the numbers are sane.