Increased error rate
Incident Report for LaunchDarkly
Resolved
This incident has been resolved. We are in the process of conducting a root cause analysis and will be sharing more details shortly.
Posted Jul 07, 2022 - 09:04 PDT
Monitoring
All services are operational and error rates have subsided. We’re continuing to monitor our systems.
Posted Jul 06, 2022 - 09:14 PDT
Update
We are applying recovery steps, and the health of the database is improving. The API is still degraded, and code references and saved dashboards are disabled, but all other services are operational.
Posted Jul 06, 2022 - 08:06 PDT
Identified
We are experiencing issues in our primary DB cluster again, leading to increased API errors and some UI unavailability.
Posted Jul 06, 2022 - 07:26 PDT
Monitoring
We have finished the recovery steps and we are monitoring the results.
Posted Jul 05, 2022 - 20:23 PDT
Update
We are continuing in applying recovery steps and some services are recovering. We will continue to monitor the impact on all downstream services.
Posted Jul 05, 2022 - 19:03 PDT
Update
We are continuing in applying recovery steps and some services are recovering. Audit log has recovered and we will continue to monitor the impact on all downstream services.
Posted Jul 05, 2022 - 18:15 PDT
Update
We are continuing in applying recovery steps and some services are recovering. We will continue to monitor the impact on all downstream services.
Posted Jul 05, 2022 - 18:02 PDT
Update
We are continuing in applying recovery steps and some services are recovering. Experimental results have recovered and are no longer delayed. We will continue to monitor the impact on all downstream services.
Posted Jul 05, 2022 - 17:05 PDT
Update
We are continuing in applying recovery steps and some services are recovering. We will continue to monitor the impact on all downstream services.
Posted Jul 05, 2022 - 15:57 PDT
Update
We are continuing in applying recovery steps and some services are recovering. We will continue to monitor the impact on all downstream services.
Posted Jul 05, 2022 - 15:15 PDT
Update
We are still in the process of applying recovery steps and continuing to monitor the impact on all downstream services.
Posted Jul 05, 2022 - 14:02 PDT
Update
We are beginning to see our primary DB recover, but some of the recovery steps are still in progress. The web UI is available but we are still seeing an impact on downstream services
Posted Jul 05, 2022 - 13:05 PDT
Update
We are still working to restore the availability of our primary database cluster. We’re still seeing a significant impact across all our services. Experimentation results are delayed currently but we do not anticipate any data loss.
Posted Jul 05, 2022 - 12:05 PDT
Update
We’re still seeing a significant impact across all our services. Error rates for flag delivery (streaming and polling) are elevated, but oscillating between 1% and 10% failure rates. Most SDK clients connecting to the service should still be receiving correct flag variations.
Posted Jul 05, 2022 - 11:01 PDT
Update
We are continuing to work on a fix for this issue.
Posted Jul 05, 2022 - 10:51 PDT
Identified
Our primary database cluster is unhealthy. This is causing widespread availability issues with our site, APIs and other downstream services.

We do not anticipate any data loss and are working to restore service as quickly as possible
Posted Jul 05, 2022 - 10:03 PDT
Update
We are still investigating the issue
Posted Jul 05, 2022 - 09:06 PDT
Update
We are continuing to investigate this issue.
Posted Jul 05, 2022 - 07:23 PDT
Investigating
We are currently investigating increased error rates across LaunchDarkly services.
Posted Jul 05, 2022 - 07:15 PDT
This incident affected: Flag Delivery Network (Server-side streaming API, Client-side streaming API, Polling API), Feature management (core functionality) (Feature workflows, Flag targeting), Feature management (non-core functionality) (Audit log), and Account management, Experimentation, Integrations and webhooks.