Code now works on cards with compute mode 3.5 (Titan, GTX780).
Tested on GTX-780 :-) It is about 275 MH/s.
any hope for compute 2.1? also how does that rate vs your cgminer speed
UPDATE:Automatic GPU architecture chooser was added it works on any architecture from 1.0 to 3.5. The relevant cubin file should be in cmake-rpcminer. I added these files on github, so use them.