A postmortem is a written record of an incident. Among other things, it documents the incident’s impact, what caused it, the actions taken to mitigate or fix it, and how to prevent it from happening again.
A one-sentence summary of the incident.
The incident's impact on customers and revenue or reputation, if known. e.g. 'Users where unable to cretae bookings from 9:20 to 11:00'
What has caused the incident?
What happend to lead to the accident?
The actions that resolved the outage. e.g. 'Disabling feature X helped to mitigate the problem. Rolling pack to version Y resolved it'
How the problem was noticed? e.g. 'Recived messasge from Z in Digital Freight Teams channel'
A list of actions taken (with links to GitHub PRs/Releases) to mitigate or resolve the problem, and to prevent it from recurring
What went well? What went wrong? And what was sheer luck/magically fixed?
A detailed timeline of the events related to the incident
Additional graphs, screenshots, command outputs, requests info, etc.