weird. the 12-18 is known to compile on CUDA 5.5. For this to work, I changed the outdated Spinlock Kernel to use compute_12, and I had to lower the shared memory use (by limiting the max. no of warps to 12 I believe). No idea why it still uses compute_30 for you.
try running ./autogen.sh before configure and make, maybe?
Christian
Its something wrong with my system, freeze when run device query for example....
about the spin lock maybe I messed up the sources

will download again and retry