if I remember well there is something called monitoring in IT. some stuff that sends alerts on threshold apparently...
Some even use it to have automatic actions when an alert is raised.
I had a DRK monitor in c-cex it failed
All the monitoring in the world wont help an issue like this if the proper checks arnt in place properly.