Yeah, the struggle is real.
There already is a tool to simplify this task: scrape a 1000+ Bitcointalk [ANN] thread into a single highly readable HTML document for better reading and analysis.
sourceAnd i am working on a fork of this tool to improve it
Sweet, and that tool sounds awesome. Thank you for the heads up!