1000 results:
374/364/175/64/19/1/2/1/0/0
(which is 1007 nonces = close to expected average of 1000)
I'm surprised so many have 0. If only we could test for a "dead" work unit and discard them.

Then you could increase all hashing performance by a massive 37% (based on that result)

... and yet this is something I looked into a long time ago (almost 2 years) early on when I first found out about bitcoin, but never completed my work on it ...
By the looks of those results I should get back to it one day and finish it ... but I doubt I'll bother since it probably won't yield anything

It started as a program to optimise hashing (and found all the GPU optimisations independently)
Those results look very close the theoretical values for the relevant Poisson distribution (lambda = 1). The first 11 theoretical values are: {367.88, 367.88, 183.94, 61.313, 15.328, 3.0657, 0.51094, 0.072992, 0.009124, 0.001014, 0.000101}. This makes sense, since mining is a Poisson process.
Something I learned: multiple nonces within one work unit occurs about 2/3 as often as a single nonce. So you should be prepared to handle multiple nonces.