AWS outage hits major apps and services, resurfacing old questions about cloud redundancy

HALL of Tech
By -
0
AWS outage hits major apps and services, resurfacing old questions about cloud redundancy Todd Bishop
AWS experienced a widespread outage early Monday that disrupted major sites and services. (GeekWire File Photo / Todd Bishop)

Amazon Web Services is showing “significant signs of recovery” after a major outage early Monday that impacted sites and services including Facebook, Snapchat, Coinbase and Amazon itself — reviving concerns about the internet’s heavy reliance on the cloud giant.

The problems began shortly after midnight Pacific in Amazon’s Northern Virginia (US-EAST-1) region. In an update shortly after 2 a.m., AWS blamed a DNS resolution issue with DynamoDB, meaning the internet’s phone book failed to find the correct address for a database service used by thousands of apps to store and find data.

OpenAI’s ChatGPT was among the sites impacted, The Verge reported. Check-in kiosks went down at LaGuardia Airport, with lines starting to form earlier this morning, the New York Times reported. DownDetector showed problems for financial apps like Venmo and Robinhood, gaming services such as Roblox and Fortnite, the Signal messaging app, and productivity tools including Slack and Canva.

In an update at 3:35 a.m., Amazon confirmed that the core DNS issue was “fully mitigated,” reporting that most services had recovered and were operating normally.

However, AWS said it was still working through a backlog of requests for Lambda, its serverless computing platform. It also warned that some customers would see increased error rates when trying to launch new instances in its core cloud computing service, EC2.

US-EAST-1 is AWS’s oldest and largest cloud region, a popular nerve center for online services, which has made it an Achilles heel for the internet over the years. Major outages originating from this same region also caused widespread disruptions in 2017, 2021, and 2023.

The latest outage suggests that many sites have not adequately implemented the redundancy needed to quickly fall back to other regions or cloud providers in the event of AWS outages.

https://ift.tt/WK19Zcy October 20, 2025 at 10:58AM GeekWire
Tags:

Post a Comment

0Comments

Post a Comment (0)