Multiple services are affected, service degradation
Incident History
Mar 5, 19:30 UTC
Resolved - On Mar 5, 2026, between 16:24 UTC and 19:30 UTC, Actions was degraded. During this time, 95% of workflow runs failed to start within 5 minutes with an average delay of 30 minutes and 10% workflow runs failed with an infrastructure error. This was due to Redis infrastructure updates that were being rolled out to production to improve our resiliency. These changes introduced a set of incorrect configuration change into our Redis load balancer causing internal traffic to be routed to an incorrect host leading to two incidents.
We mitigated this incident by correcting the misconfigured load balancer. Actions jobs were running successfully starting at 17:24 UTC. The remaining time until we closed the incident was burning through the queue of jobs.
We immediately rolled back the updates that were a contributing factor and have frozen all changes in this area until we have completed follow-up work from this. We are working to improve our automation to ensure incorrect configuration changes are not able to propagate through our infrastructure. We are also working on improved alerting to catch misconfigured load balancers before it becomes an incident. Additionally, we are updating the Redis client configuration in Actions to improve resiliency to brief cache interruptions.
Mar 5, 19:17 UTC
Update - Webhooks is operating normally.
Mar 5, 19:05 UTC
Update - Actions is operating normally.
Mar 5, 18:59 UTC
Update - Actions is now fully recovered.
Mar 5, 18:15 UTC
Update - The queue of requested Actions jobs continues to make progress. Job delays are now approximately 6 minutes and continuing to decrease.
Mar 5, 17:48 UTC
Update - We are back to queueing Actions workflow runs at nominal rates and we are monitoring the clearing of queued runs during the incident.
Mar 5, 17:25 UTC
Update - We have applied mitigations for connection failures across backend resources and we are observing a recovery in queueing Actions workflow runs.
Mar 5, 16:52 UTC
Update - We are observing delays in queuing Actions workflow runs. We’re still investigating the causes of these delays.
Mar 5, 16:47 UTC
Update - Webhooks is experiencing degraded availability. We are continuing to investigate.
Mar 5, 16:41 UTC
Update - Actions is experiencing degraded availability. We are continuing to investigate.
Mar 5, 16:35 UTC
Investigating - We are investigating reports of degraded performance for Actions