Hi, I've tested your version and it works really well. With the stock version I get about 4500 khash/s and now I get 4900 khash/s, that's almost a 10% increase, which is really nice.

I'm also trying to build bitcoin myself to tweak performance a bit, but haven't gotten past the default debug build yet. It would be nice if you could share your compilation settings?