Skip to content

Instantly share code, notes, and snippets.

@jnewland
Created March 15, 2012 00:19
Show Gist options
  • Save jnewland/2040649 to your computer and use it in GitHub Desktop.
Save jnewland/2040649 to your computer and use it in GitHub Desktop.

The Ack Bar

Brainstorming an ideal landing page for indicating the status of working an Ops/Nagios/PagerDuty alert.

First, indicate your initial thoughts on the alert

I know, bro

For problems you're already aware of and are working on.

  • Missed scheduling downtime
  • Side effect of something else
  • Will resolve itself shortly

Responses of this type should make it obvious to the team that no action is required by anyone else.

Not an anticipated alert, but you intend and are able to respond and resolve. Responses of this type should notify others of your ETA to full availability so they can steal if they're more available.

Indicating this status should immediately take you to more informations about the alerts (graphs, etc) so you can begin to decide on a resolution path, dismiss the alert, or escalate.

FUCK

For alerts you can't handle this for some reason, either because they are above your head, or you're unable to get to the tools necessary at the time (laptop, VPN, etc)

Responses of this type should make it possible to escalate to the backup oncall person or page everyone. Reponses should also include a short reason for your panic.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment