Post
Topic
Board Beginners & Help
Re: further improved phatk OpenCL kernel (> 2% increase) for Phoenix - 2011-07-01
by
Diapolo
on 04/07/2011, 04:32:05 UTC
Good, but this

> - added: "u t1W" variable, which is used in sharound2() to avoid double execution of t1W()

may actually hurt the performance, in theory. If you're using more registers, at least some GPUs may not be able to run as many threads concurrently as they used to, thus slowing things down.

That one was removed a few hours after I added it, don´t worry Smiley. You can safely remove "u t1W;" and replace "t1W = t1w(n);" with "t1 = t1W(n);" in sharound2.

Dia