Incident Response at PagerDuty

Home

This documentation covers parts of the PagerDuty Incident Response process. It is a cut-down version of our internal documentation used at PagerDuty for any major incidents and to prepare new employees for on-call responsibilities. It provides information not only on preparing for an incident, but also what to do during and after the incident. It is intended to be used by on-call practitioners and those involved in an operational incident response process (or those wishing to enact a formal incident response process). See the about page for more information on what this documentation is and why it exists.

Don't know where to start?

If you're new to incident response and don't yet have a formal process in your organization, we recommend looking at our Getting Started page for a quick list of things you can do to begin. Additionally, you can check out our Training Course page for a more detailed overview of our process.

Being On-Call#

If you've never been on-call before, you might be wondering what it's all about. These pages describe what the expectations of being on-call are, along with some resources to help you.

Before an Incident#

Reading material for things you probably want to know before an incident occurs. You likely don't want to be reading these during an actual incident.

During an Incident#

Information and processes during a major incident.

After an Incident#

Our followup-processes, how we make sure we don't repeat mistakes, and are always improving.

Crisis Response#

Incident response is about more than dealing with technical incidents. A crisis can happen at any time. Are you ready for it? The way you handle your worst day will leave lasting impressions about your brand and its perceived value in the eyes of your current and potential customers.

Training#

So you want to learn about incident response? You've come to the right place.

Training Guides#

Training Courses#

Additional Resources#

Useful material and resources from external parties that are relevant to incident response.