Also makes me wonder why people don't post them for everyone to see. So, 3% or even 5% speed increase doesn't really matter all that much if you're just a regular guy with a couple of ATI cards, right? Or am I wrong?
PcChip - give me your best advice, should I keep my next optimization to myself or post it?

I can't answer that my friend, only you can.
If you're the type of person who loves tinkering with asm/opencl/hardeware code just for fun, and improving things for the community (which it seems like you are, and a redditor as well - highfive! [Did you go to a reddit meetup saturday?]) then it makes sense to continue posting them like you are.
If you're in bitcoin for profits, and want to use your intelligence to tweak the code to give yourself a (totally fair) advantage over the competition, then obviously it's not in your best interest to post it. Note that I'm not saying there's anything wrong with this scenario, because I can't even say which route I'd pick myself.
I can say however, that I very much appreciated that speedup you posted, as did many others, and I hope you get many more BTC than the five you currently had donated to you at
http://blockexplorer.com/address/15igh5HkCXwvvan4aiPYSYZwJZbHxGBYwB