All Systems Operational
Website Operational
Platform Analysis ? Operational
Classic Analysis ? Operational
GitHub Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
System Metrics Month Week Day
Error Rate
Fetching
Analysis processing time
Fetching
Past Incidents
Oct 10, 2016

No incidents reported today.

Oct 9, 2016

No incidents reported.

Oct 8, 2016

No incidents reported.

Oct 7, 2016

No incidents reported.

Oct 6, 2016
Resolved - Queue depth is back down and classic analysis wait times have returned to normal levels.
Oct 6, 23:42 EDT
Identified - We've identified a delay for classic analysis due to high queue depth. We're bringing additional capacity online to work through the backlog.
Oct 6, 23:24 EDT
Oct 5, 2016

No incidents reported.

Oct 4, 2016

No incidents reported.

Oct 3, 2016
Resolved - This incident has been resolved.
Oct 3, 11:48 EDT
Monitoring - We've worked through the backlog of service events, meaning PR updates are going out in a timely fashion again. Some other downstream queues still have a backlog to get through (e.g. emails, churn calculations), which we'll continue to monitor.
Oct 3, 10:54 EDT
Update - We identified a code change that was causing one of our jobs to run much slower than expected. After reverting this change, things are moving again. PR status updates will still be delayed until we've cleared the backlog of work.
Oct 3, 10:48 EDT
Identified - We've identified that pull request status updates may not be going out, as there's currently a large backup in one our worker queues. We're investigating the cause now.
Oct 3, 10:32 EDT
Oct 2, 2016

No incidents reported.

Oct 1, 2016

No incidents reported.

Sep 30, 2016

No incidents reported.

Sep 29, 2016

No incidents reported.

Sep 28, 2016

No incidents reported.

Sep 27, 2016
Resolved - This incident has been resolved.
Sep 27, 14:15 EDT
Monitoring - We made a change to our churn calculation job to perform far less work. Calculation time has returned to normal.
Sep 27, 13:39 EDT
Identified - We've been alerted to a delay in calculating churn information. We're bringing on more workers to handle the increased traffic.
Sep 27, 11:13 EDT
Sep 26, 2016
Resolved - This incident has been resolved.
Sep 26, 09:51 EDT
Monitoring - We've made it through the backlog and the system is operating normally now.
Sep 26, 08:32 EDT
Identified - An incident has been reported in our AWS region that explains the networking connectivity errors we were seeing. The system seems healthier now, but analysis is still delayed as we work through a backlog of work.
Sep 26, 08:23 EDT
Investigating - We're experiencing connection timeouts talking to 3rd-party services like GitHub, Stripe, etc. Users may experience errors on the website, a delay in analysis starting, or analysis errors related to the clone step. We're investigating.
Sep 26, 08:06 EDT