Amazon Cloud Unit Faces Multiple AI Tool Outages, FT Reports

February 20, 2026

7 min read

Share with:

In mid‑December 2025, Amazon’s cloud arm, Amazon Web Services (AWS), faced an unexpected tech snag that lasted more than half a day. According to a report from the Financial Times on 20 February 2026, at least two service interruption in December were linked to errors involving AWS’s own AI tools, including its internal coding assistant known as Kiro.

While AWS says these disruptions stemmed from user error, not a flaw in the AI itself, the incidents have sparked fresh questions about how far automated tools should be trusted inside critical cloud systems.

December 2025 Interruption Hit Amazon Cloud Unit, AWS

What Happened During the AI‑Linked Outages?

In mid‑December 2025, Amazon’s cloud division Amazon Web Services (AWS) suffered at least two system interruptions tied to its own AI tools, according to a Financial Times report on 20 February 2026. These reports, based on individuals familiar with the matter, reveal that AWS engineers allowed an internal AI coding agent to make system changes to a production system used by customers.

The autonomous AI, known as Kiro, chose to delete and recreate parts of the system environment, triggering an interruption that lasted about 13 hours. AWS representatives have disputed that these incidents represent inherent AI failures and instead described them as stemming from user error due to misconfigured access controls.

AWS said the issue affected only one region in mainland China and did not impact its broader cloud infrastructure, including compute, storage, database, or other AI services.

Why Did the AI Tool Cause an interruption?

The AI tool involved, Kiro, is an autonomous coding assistant developed by AWS that can take actions based on instructions without explicit real‑time human intervention. In the December interruption, engineers granted Kiro permissions equivalent to human developers, allowing it to make live system changes. According to internal sources, this lack of standard safety checks, such as peer code review or approvals, was a key factor in the incident.

🔸 AI CODING BOT CAUSED 13-HOUR AWS OUTAGE, SPARKS SAFETY DEBATE

According to the Financial Times, a December 2025 AWS outage was caused by Amazon’s internal AI coding agent, Kiro. The disruption lasted up to 13 hours after the autonomous bot, tasked with fixing a minor bug in…
— 𝗺𝗼𝗼𝗱𝗶𝘅 (@MoodixMarket) February 20, 2026

Financial Times reports that this was not a one-time event. In recent months, engineers linked at least two production outages to internal AI tools. Employees said these issues were “entirely foreseeable” due to weak oversight.

How Did AWS Explain the Interruption?

AWS publicly stressed that the December Interruption were limited and brief, and did not signal systemic failure across its global platforms. The company stated that:

The disruption was due to misconfigured access controls rather than an AI malfunction.
The affected system was a small part of AWS’s cloud services, impacting only one service in China.
There was no evidence that AI tools made more mistakes than conventional developer tools.

AWS clarified that AI agents in general still request authorization before taking action, and that in this case, the permissions given to the engineer were broader than intended, enabling the AI to enact changes normally requiring manual input.

How Does This Compare to Past AWS Interruption?

The December incident occurred amid heightened scrutiny of AWS reliability. In October 2025, AWS also experienced a major global outage unrelated to Kiro or other internal AI agents. This event disrupted thousands of popular apps and services, including Reddit, Roblox, and Snapchat, when infrastructure failures in a core region cascaded across multiple platforms.

BREAKING: Amazon Web Services

A massive AWS outage early this morning sent major websites and apps down for several hours

Tracking site Downdetector has received over 8 million reports around the globe pic.twitter.com/Fgtzt2hW1x
— Morning Brew ☕️ (@MorningBrew) October 20, 2025

That outage, lasting several hours, highlighted how dependent major parts of the internet are on AWS’s infrastructure and how a single failure point, such as DNS or database services, can affect a huge ecosystem of services and applications.

What are the Broader Industry Implications?

These AI‑linked interruptions raise questions about how companies balance the efficiency gains of autonomous AI tooling with risk controls. As cloud providers like AWS push deeper into AI integration, including systems that can operate for days without human input, there is a growing debate over how much autonomy is appropriate for live production environments.

While according to AWS Spokesperson:

In both cases, the issue was user error, not AI error. The December event was an extremely limited incident when AWS Cost Explorer in one of our two regions in Mainland China was affected for 13 hours. No code was deployed, and this event did not impact compute, storage, database, AI technologies, or any other of the hundreds of services we run. We are not aware of any related customer inquiries from this isolated interruption. The issue stemmed from a misconfigured role, the same issue that could occur with any developer tool or manual action. Kiro puts developers in control, users configure which actions Kiro can take, and by default, Kiro requests authorization before taking any action. Following the December incident, AWS implemented numerous safeguards, including mandatory peer review for production access, enhanced training on AI-assisted troubleshooting, and resource protection measures.

Industry experts suggest that modern cloud systems demand strong guardrails around autonomous actions, similar to established practices like peer reviews, approval gates, and strict access controls that apply to human developers. The December incidents at AWS serve as a real‑world test case for these emerging governance practices.

Are Customers and Employees Concerned?

According to internal reporting, some AWS employees have expressed concern about the pace of rollout for internal AI tools and whether current safeguards are sufficient. These concerns reflect broader skepticism in the tech industry about the best way to harness advanced AI systems without introducing undue operational risk.

Meanwhile, AWS continues to advocate for its AI tooling strategy, pointing to strong adoption rates among developers and asserting that proper configurations will minimize future disruptions.

How Should Companies Using AWS Prepare?

For developers and businesses that rely on AWS cloud services:

Plan for multi‑region deployments to avoid single‑region failures.
Implement strict access and authorization controls for automated tools.
Use external monitoring tools instead of relying solely on cloud provider status pages.
Consider failover and redundancy strategies as part of disaster planning.

AI tools are a valuable asset when used correctly, but they require careful governance. As toolchains incorporate intelligent automation, companies should evaluate these tools like any other in production, with high standards for accountability and safety.

Wrap Up

The December 2025 AWS limited supply shows that even leading cloud providers are not immune to risks from AI tools. While AWS calls it a user‑error issue, the incidents highlight the need for careful oversight, strict access controls, and clear governance when using autonomous AI in live systems.

Companies relying on cloud services should plan for redundancy, monitor automated tools, and balance efficiency with safety. As AI becomes more integrated into cloud infrastructure, these lessons will be essential for maintaining reliability and trust.

Frequently Asked Questions (FAQs)

What caused the recent AWS AI outage?

In December 2025, AWS faced an outage linked to its internal AI tool, Kiro. Engineers gave it permission to make system changes, which led to service disruption for hours.

Which apps were affected by the AWS outage?

The outage mainly affected one AWS region in China. Some customer services experienced delays, but most global apps and platforms continued running normally during December 2025.

Will AI make AWS interruptions more frequent?

Experts say AI tools can speed work but may add risks. Proper oversight and strict access controls are needed to prevent incidents like the December 2025.

Disclaimer:

The content shared by Meyka AI PTY LTD is solely for research and informational purposes. Meyka is not a financial advisory service, and the information provided should not be considered investment or trading advice.

What brings you to Meyka?

Pick what interests you most and we will get you started.

I'm here to read news

Find more articles like this one

I'm here to research stocks

Ask our AI about any stock

I'm here to track my Portfolio

Get daily updates and alerts (coming March 2026)

Meyka Newsletter

Get analyst ratings, AI forecasts, and market updates in your inbox every morning.

~15% average open rate and growing

Trusted by 10,000+ active investors

Free forever. No spam. Unsubscribe anytime.

Samsung Electronics Plans $73 Billion Investment to Dominate AI Chip MarketMar 19, 2026

Apple China Sales Jump 23% in Early 2026, Defying Smartphone Market SlumpMar 19, 2026

Meta Shuts Down Horizon Worlds as It Moves Further Away from the MetaverseMar 19, 2026