All Systems Operational
Website ? Operational
Zaps Operational
Instant Triggers ? Operational
Polling Triggers ? Operational
Searches & Writes ? Operational
Apps ? Operational
Developer Platform Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
System Metrics Month Week Day
Polling Triggers Reliability ?
Fetching
Polling Triggers Response Time ?
Fetching
Instant Triggers Reliability ?
Fetching
Instant Triggers Response Time ?
Fetching
Support Queue ?
Fetching
Past Incidents
Jun 10, 2017

No incidents reported today.

Jun 9, 2017

No incidents reported.

Jun 8, 2017

No incidents reported.

Jun 7, 2017

No incidents reported.

Jun 6, 2017

No incidents reported.

Jun 5, 2017

No incidents reported.

Jun 4, 2017

No incidents reported.

Jun 3, 2017
Resolved - We apologize for the delay in updating this issue, but we have good news!

After resolving the RabbitMQ outage yesterday -- we started looking at all recovery options for any Tasks not ran during the outage.

First were any Tasks stuck in pending (there were ~11k Tasks stuck in this state), and second were any webhooks (from instant Zaps) that were completely missing (there were about ~139k of these).

After verifying and shipping a few performance improvements to our recovery mechanisms, we're happy to say that we've completed recovery of Tasks impacted during the 15 minute outage yesterday (between 2017-06-02 19:07:00 UTC and 2017-06-02 19:22:00 UTC). If you continue to experience missing Tasks during the window, please contact support via [email protected].

We'll be making some changes to how our RabbitMQ cluster behaves, as well as some further speed improvements for recovery mechanisms to both prevent the impact of future outages and speed up our response time in recovering any Tasks during future outages.
Jun 3, 11:53 PDT
Update - Everything looks resolved. We're doing some final investigation into possible lost Tasks and recovery options.

The hard outage lasted about 15 minutes, and the ramp up to recovery lasted about hour or so.
Jun 2, 13:58 PDT
Monitoring - Things look stable again, we're ramping up Zap speed back to normal, everything should be fine shortly. We'll have more information on possibly lost Tasks and recovery efforts after we get everything running smoothly again.
Jun 2, 12:58 PDT
Identified - The issue has been identified and a fix is being implemented.
Jun 2, 12:32 PDT
Update - We have identified the outage, it is isolated to a single RabbitMQ node responsible for queueing tasks. We've temporarily paused Tasks as we resolved the outage, and are working to bring back all Tasks. More info soon.
Jun 2, 12:32 PDT
Investigating - Tasks are not running while we looking into a possible queueing outage. More info soon.
Jun 2, 12:17 PDT
Jun 1, 2017
Resolved - Slack Zaps are now triggering as normal. Any Zaps using Slack triggers that were created between 8:00-10:30 UTC may still experience this issue, which can be rectified by recreating the Zap.
Jun 1, 04:16 PDT
Investigating - We are currently investigating an issue whereby some Slack Zaps are failing to trigger. We'll update this incident when we know more.
Jun 1, 02:25 PDT
Resolved - AWeber reports their outage has been resolved (https://status.aweber.com/incidents/lrg7xc04ggmr).
Jun 1, 02:23 PDT
Identified - We are monitoring an outage at AWeber (https://status.aweber.com/incidents/lrg7xc04ggmr) that is causing errors for Zaps.
May 30, 08:19 PDT
May 31, 2017

No incidents reported.

May 29, 2017

No incidents reported.

May 28, 2017

No incidents reported.

May 27, 2017

No incidents reported.