simulab

AWS SRE Incident Response SimuLab

Difficulty: Beginner
Duration: Up to 1 hour
Students: 3
Real ScenariosStep into challenges inspired by real-world scenarios.
AI CharactersChat with life-like AI-powered characters to investigate and troubleshoot.
Think Like a ProStay calm, ask the right questions, and focus on solutions.

Description

A well-intentioned security change has quietly broken the production application. Users can load the page, but core functionality is blocked. Step in as the on-call SRE to investigate the issue, restore service, and put safeguards in place to prevent it from happening again.

Assessed topics

  • Site Reliability Engineering (SRE)
  • Incident investigation and response
  • Observability and monitoring

Intended audience

  • Site Reliability Engineers
  • Cloud Engineers

Prerequisites

Familiarity with the following will be beneficial but is not required:

  • AWS Web Application Firewall (WAF)
  • AWS Application Load Balancer (ALB)
  • Amazon CloudWatch Metrics

The following content can be used to fulfill the prerequisites:

Simulab UUID

Scenario Rules

  1. Open a Ticket to StartDon't leave customers waiting! Move a ticket to “In Progress” and dive in.
  2. Ask & ExploreChat and investigate to gather the information you need.
  3. Resolve & SubmitConfident in your fix? Mark tickets as “Done” and submit your lab.