Here is a situation most engineering teams have lived through. An alert fires: API error rate is above 5%. The on-call engineer opens the dashboard. Error rate is elevated, confirmed. They look at CPU, memory, and database connection counts. All norma…