Skip to content

Instantly share code, notes, and snippets.

@mweiden
Last active August 15, 2018 21:15
Show Gist options
  • Save mweiden/66787f1ac0edaf076a50abdbcdfb98ac to your computer and use it in GitHub Desktop.
Save mweiden/66787f1ac0edaf076a50abdbcdfb98ac to your computer and use it in GitHub Desktop.
A runbook template for HCA DCP

[System Name]: Run Book / System Operation Manual

Service or system overview

README.md link:

Service owner

Organization: Team: Engineering slack channel:

System characteristics

Throttling and shutdown

Tools

System configuration

Configuration management

Secrets management

System backup and restore

Backup requirements

Backup procedures

Restore procedures

Monitoring and alerting

Log aggregation

Metrics

Health checks

Operational tasks

Deployment

Recovery procedures by failure mode

[Failure Title #0]

Symptoms

Recovery procedure

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment