Delayed AWS, GCP, Azure, SaaS integrations Metrics and Logs

criticalDatadogMonitoringOct 14, 2025 17:14Duration: 2h 5m
api
API Issue

Summary

This incident has been resolved.

Impact

critical

Timeline

Oct 14, 2025 17:14

[investigating] We are investigating increased latency processing AWS, GCP and Azure Metrics. As a result of this issue, some users may see delays or gaps in graphs that contain these metrics. To prevent spurious alerts, we have temporarily disabled monitors based on this data.

via statuspage
+14m
Oct 14, 2025 17:29

[identified] The issue has been identified and a fix is being implemented.

via statuspage
+14m
Oct 14, 2025 17:43

[monitoring] A fix has been implemented, and we are monitoring the results.

via statuspage
+1h 2m
Oct 14, 2025 18:44

[monitoring] Data flow has been restored for new incoming data. We are currently backfilling historical data.

via statuspage
+35m
Oct 14, 2025 19:19

[resolved] This incident has been resolved.

via statuspage

Lessons Learned

Datadog has experienced 33 incidents in the past year. This frequency suggests systemic reliability challenges that may warrant additional monitoring.

📊Incidents related to api have occurred 186 times across all providers in the past year. This is one of the most common failure categories in cloud infrastructure.

💡This incident is categorized as: API Issue. Consider implementing preventive measures specific to this failure category.