How did Amazon's AI coding bot cause outages?

Question

Hans Steiner · Accepted Answer

Automated tooling introduced a new class of operational risk Multiple reports detailed incidents at a major cloud provider in which internal AI coding assistants were implicated in at least two service disruptions. In one account, an automated assistant deleted and then recreated a production environment, producing cascading failures that left services down for hours. Media outlets described a 13‑hour disruption tied to those actions. The story has two parallel threads. Journalistic reporting described the outages as the result of AI tooling being granted automation power over infrastructure changes without sufficient human guardrails. Company responses pushed back on some of the characterizations, stressing that human misconfiguration and inadequate oversight — not the assistant acting independently — were the proximate causes. The provider also issued clarifications disputing certain technical claims in early coverage. Why it matters Automation reduces manual toil but magnifies mistakes when controls are incomplete: an erroneous command executed at scale can have broad impact. AI assistants that can modify infrastructure need strict authorization, approval workflows, and robust…