Incident Simulator

Incident Commander

Diagnose the root cause. Run the right commands. Save production.

TIME REMAINING
5:00

SCENARIO: The Traffic Spike

You are the on-call engineer. A massive marketing push just went live. The api-server deployment is returning 502 Bad Gateway errors. CPU alerts are firing. You have exactly 5 minutes to restore the service using the Kubernetes terminal.