Post Mortem: System Outage for DoiT Console App Date: April 24, 2023
Incident Summary: On April 24, 2023, the DoiT Console App experienced intermittent 404 errors and NO_BACKEND_SELECTED issues. This caused a significant disruption in the app's functionality, affecting the user experience.
Timeline (Eastern Time):
Root Cause: The root cause of the issue was a change rolled out to a subset of Google Front End (GFE) servers. This change allowed the GFE to occasionally select an App Engine target that didn't contain the correct service, resulting in 400 series errors and the no backend selected status.
Resolution and Recovery: Google's product engineers rolled back the change in the affected GFEs, which resolved the issue. The rollback was completed on April 25, 2023, at 3:56 AM Eastern Time.
Going forward, our team will continue to monitor the app's performance and promptly report any anomalies to ensure the best possible user experience for the DoiT Console App.