Yeah, maybe not too suited to post it in this thread, or on Bitcointalk in general. Decide for yourself, perhaps you could adjust the query so the results are smaller in those cases? To only include the "extreme" cases for example.
There's another drawback: it requires much more scraping (and I already scrape a lot from Bitcointalk), so I'll only do this once.