Skip to content

Instantly share code, notes, and snippets.

Embed
What would you like to do?
Incident Post Mortem Template

Post-mortem (incident #1)

Date

2019-03-18

Status

Completed

Summary

Overview of what happened

Impact

Who was impacted and how much they were impacted

Resolution

Description of what solved the problem. If was a temporary fix, describe the long-term solution

Root Cause

Description of the root cause

Action Items

Action Item Issue Owner Status
Created an issue to integrate deployment with E2E testing github.com/project/issues/10 John DONE
Scheduled a meeting with the SRE Team to talk about the problem - Doe DONE

Timeline

2018-03-18 (all times UTC−03:00)

Time Description
09:20 Received an alert from Opsgenie of an increased error rate on the deployments system
09:24 Discovered several error logs on AWS Beanstalk regarding a service X due to a memory limit issue
09:35 Reviewed last commits on the deployment services and identified a new NPM package introduced yesterday

Extra References

Previous configuration

version: 2
jobs:
  build:
    docker:
      - image: circleci/<language>:<version TAG>
    steps:
      - checkout
      - run: <command>
  test:
    docker:
      - image: circleci/<language>:<version TAG>
    steps:
      - checkout
      - run: <command>
workflows:
  version: 2
  build_and_test:
    jobs:
      - build
      - test

New configuration

version: 2
jobs:
  build:
    docker:
      - image: circleci/<language>:<version TAG>
    steps:
      - checkout
      - run: <command>
  test:
    docker:
      - image: circleci/<language>:<version TAG>
    steps:
      - checkout
      - run: <command>
workflows:
  version: 2
  build_and_test:
    jobs:
      - build
      - test
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment