Problem is that it is not just few flips, it is much more than that,
as you can see many people posted that crap on Telegram and discord, and most of them have Strong consensus.
That means that consensus is not good enough and need fixing.
Because people are crying too much about every bad flip they see. Bad flips happens but I don't think it is that big issue. Just take a look at my photo (it is actual data based on 13 validations not based on how much few are crying and how bad they are crying)
Out of 288 flips (long and short) I was only 3.5 p. short from max. How much of this was my fault? Maybe 0 maybe 1,2,3. Based on that i assume that ~1% of flips are flips that was bad and passed.
Last 8 validation i could go to telegram and post "ohh my flips was so great. I get 100%" - but people do not do that. Human nature is crying if something goes wrong.