You are on-call for a service named “http-router”, a simple HTTP router service. It’s sole purpose in life is to take requests from a front-end web application and pass them to backend services named “order-service” and “return-service”. You are on call for the “http-router”, the other services are managed by other teams at the company.
While sitting on your sofa during a lovely holiday weekend and listening to music, you are interrupted by the sounds of your phone telling you that you’ve received a text message. It’s an alert saying “http-router: unable to reach order-service (STATUS: 500)”. It’s now time to turn off the gramophone and face a different kind of music.
For this scenario you have access to your laptop, and expected monitoring tool suite. This scenario occurs during a long weekend, in the evening