I am also working on a version of the BitCoin CUDA Client and will do all that I can to help this project. It will be opensourced as well, for the same reasons nelisky stated. I will test this first version out on my GTX 460(borrowing from a friend

) as soon as I can. Also, the pseudocode I've been writing up should be quite a bit faster than this currently is because it is incorporating FPGA styling and CUDA programming.
I should have something up pf my code up after the weekend.
Nice! I've wanted to get into FPGA's too, and bitcoin seems very fitting, but unfortunately time doesn't stretch that much.
BTW, the code I posted is slightly broken

If the cuda thread generates a valid block it will die, for where it reads in main.cpp:
if (DEBUG || hash <= hashTarget) {
pblock->nNonce = keep+i;
break;
}
it shoudl read
if (DEBUG || hash <= hashTarget) {
pblock->nNonce = tmp.block.nNonce = keep+i;
break;
}