Post
Topic
Board Announcements (Altcoins)
Re: [ANN] [SKC] Skeincoin | Skein-SHA2 | CPU mining | GPU miner available
by
madjihad
on 03/01/2014, 01:07:01 UTC
Looks like you misunderstand the local memory, it is shared between threads in a workgroup, and you do not want all threads writing in this array simultaneously. The idea is to get rid of array and replace it wilt just 16 uint variables: AMD stores arrays in global memory when optimization cannot coerce them into registers (and optimization is not necessarily supersmart), and uint variables are always mapped to registers (then there may be register spilling if you don't have enough of them, but at least you get a warning).

Ah, big thanks for the clarification. Probably the time to read opencl documentation has come Smiley
And it's not the last my fail for today Embarrassed I've tested double Skein today and it was runing at 780MH/s on 5870 only due to avoiding last bitwise AND check (hash & 0xf0ffffff). So it's wrong data Sad And maybe your SHA256 implementation performs really well and there might be no reason to change it. Will start testing and search of the bottleneck from a scratch tomorrow...sorry for disinformation  Embarrassed