Amazon investigating major cloud outage, GitHub and Heroku report issues
Amazon Web Services' central cloud computing platform, EC2, and web storage service, S3, suffer major outages
Update 10-08-2015 10.48: At 2.38am PDT, Amazon said it has taken multiple steps to reduce latencies and error rates for S3, but customers may continue to experience issues.
Update 10-08-2015 10.53: Amazon has reported further issues with CloudSearch, Elastic MapReduce clusters, CloudTrail events, Lambda APIs, and provisioning and scaling times for load balancers.
Update 10-08-2015 12.30: At 3.46am PDT, Amazon said the errors have been corrected and all services are operating normally. 'We identified the root cause and pursued multiple paths to recovery,' it said.
Amazon Web Services (AWS) is facing a major cloud outage this morning with both its Elastic Compute Cloud (ECC) and Simple Storage Service (S3) solutions experiencing a huge number of request errors in the U.S.
At 12.36am PDT, the company said it is investigating elevated errors for requests made to Amazon S2 in the US-STANDARD region.
Fifteen minutes later, it reported further request errors for the EC2 APIs and launch failures for new EC2 instances in the US-EAST-1 region.
It has since also reported elevated latencies affecting its SendEmail, SendRawEmail and SendSMTPEmail APIs in the US-EAST-1 region.
Just under 30 minutes after the problems started, Amazon said it was investigating the issues.
Then at 1.08am PDT, the company said: ‘We believe we have identified the root cause of the elevated error rates and latencies for requests to the US-STANDARD Region and are working to resolve the issue.’
At 1:52 AM PDT, it added: 'We are actively working on the recovery process, focusing on multiple steps in parallel. While we are in recovery, customers will continue to see elevated error rate and latencies.'
Cloud platform-as-a-service firm Heroku, which runs on AWS, has confirmed in the last hour that it is having issues with an upstream provider, resulting in build errors and customers not being able to push changes to apps.
Another AWS customer, hosting service GitHub, has also said it is investigating increased error rates on repository release downloads and issue image uploads, while numerous other customers have taken to Twitter to report outages.
Previous outages have affected websites like Vine and Airbnb, and highlight how many of the world's most popular web services rely on the health of AWS's data centres.
We've identified an issue with a provider and are continuing to monitor the situation.— GitHub Status (@githubstatus) August 10, 2015
Update: Issues with app builds https://t.co/LIhXGF9f2j— Heroku Status (@herokustatus) August 10, 2015
Identified: AWS EC2 service in N. Virgina is also experiencing launch failures. AWS has identified the root cau... http://t.co/mOXBl6HR90— Engine Yard Cloud (@eycloud) August 10, 2015
Sorry, our provider has issues which causes delays in cloud encoding. Kollab Transfer is a workaround until fixed. http://t.co/L3Bjmmuky8— Kollaborate (@KollaborateTV) August 10, 2015
Buckle your seatbelts, folks, AWS just changed a status icon to yellow. pic.twitter.com/YaSqWJWXsl— Nick Zadrozny (@nz_) August 10, 2015
Update: AWS S3 is experiencing performance issues that is affecting calls to our Push API. We're closely monitoring the situation.— Geckoboard (@geckoboard) August 10, 2015
Investigating: Performance issues on Amazon S3 http://t.co/MSmEyGGwMm. You may experience issues while uploading and downloading files.— Convo (@convo) August 10, 2015
We are seeing a higher error rate for downloading and viewing images due to the Amazon S3 outage. http://t.co/l5gyK0yHFp— Schedugram (@schedugram) August 10, 2015
[status] Investigating: Amazon's S3 service is currently experiencing issues which can cause errors uploading a... http://t.co/5Vc8QIu2G2— Atlassian HipChat (@HipChat) August 10, 2015
We have detected latency and increased error rates caused by an issue with an upstream provider. We are working to diagnose the problem.— Mapbox (@Mapbox) August 10, 2015
Status - We’re experiencing some storage issues from one of our providers, we’re currently serving your content from our EU region. 1/ #aws— Flat (@flat_io) August 10, 2015