I did a full repull of the files from charlie's github and it is working now, using only 1 of my 2 gpu's. I did uninstall the CUDA dev prog as well after seeing you did, and pulled the correct file it needed from github.
As of now, not seeing any difference in hash rate on a Fermi card, but I assume that's what was expected.