Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save dastergon/61f5f4c7994f29c515991419a3d7878c to your computer and use it in GitHub Desktop.
Save dastergon/61f5f4c7994f29c515991419a3d7878c to your computer and use it in GitHub Desktop.

SREcon16 Europe - Microsoft Azure SRE - Service Roast Sample

Topics
Service Overview What is it, who uses it, where does it fit in overall
Technical Architecture Overview, upstream dependencies, sub-components
Development Process Source control, external dependencies, build, test, tools
Change Management / Deployment Process, technology, cadence, gates, rollback
Configuration Management Process, technology, source control
Demand Forecasting, Capacity Management How do you shift load, or scale? How do you load test? Can you shed load?
SLAs, SLI, SLOs, KPIs, etc. What are your targets? Are you meeting them?
Monitoring, Logging, Diagnostics, Tickets How do you monitor, diagnose? How noisy?
Incident Response, production playbook, disaster recovery, backup/restore How do you respond to issues? What is your waste case plan? Do you use it regularly?
Review of Past Outages, War Stories What has gone wrong previously? How was it fixed?

Source: How to Improve Your Service by Roasting It

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment