Uptime is critical for modern systems, but downtime and security incidents are inevitable. Your users’ experience depends on your ability to respond quickly, confidently, and consistently when things go awry.
In this course, learn how to handle unexpected crises in information systems from a DevOps perspective. Instructor Ernest Mueller steps through the overall incident response process, explaining how to define what constitutes an incident for your organization and select the tools you’ll need to mitigate these high-stakes problems. He also explains how to detect and report incidents, communicate with users and internal employees about issues, troubleshoot problems, and continuously improve your incident management process.
The incident response process
Detecting and reporting incidents
Communicating effectively about a problem
Best practices for diagnosis and repair
Cleaning up after an incident
Implementation challenges for incident response