Topics | |
---|---|
Service Overview | What is it, who uses it, where does it fit in overall |
Technical Architecture | Overview, upstream dependencies, sub-components |
Development Process | Source control, external dependencies, build, test, tools |
Change Management / Deployment | Process, technology, cadence, gates, rollback |
Configuration Management | Process, technology, source control |
Demand Forecasting, Capacity Management | How do you shift load, or scale? How do you load test? Can you shed load? |
SLAs, SLI, SLOs, KPIs, etc. | What are your targets? Are you meeting them? |
Monitoring, Logging, Diagnostics, Tickets | How do you monitor, diagnose? How noisy? |
Incident Response, production playbook, disaster recovery, backup/restore | How do you respond to issues? What is your waste case plan? Do you use it regularly? |
Review of Past Outages, War Stories | What has gone wrong previously? How was it fixed? |
Created
June 14, 2017 08:10
-
-
Save dastergon/61f5f4c7994f29c515991419a3d7878c to your computer and use it in GitHub Desktop.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment