Metrics data ingestion delayed and monitor evaluations degraded

highDatadogMonitoringDec 9, 2025 20:16Duration: 3h 29m
api
API Issue

Summary

This incident has been resolved.

Impact

major

Timeline

Oct 28, 2025 18:32

[identified] The issue has been identified and a fix is being implemented.

via statuspage
+3h 29m
Oct 28, 2025 22:01

[resolved] This incident has been resolved.

via statuspage
+1006h 15m
Dec 9, 2025 20:16

[investigating] We’re currently monitoring an issue causing delays in distribution metric processing in our US1 region.

via statuspage
+20m
Dec 9, 2025 20:36

[identified] The issue has been identified and a fix is being implemented.

via statuspage
+24m
Dec 9, 2025 21:00

[monitoring] Live distribution metrics are available and being evaluated for all monitors. Gaps in graphs from the beginning of the incident are in the process of being backfilled.

via statuspage
+8m
Dec 9, 2025 21:08

[resolved] This incident has been resolved. Live data is being processed normally and gaps in distribution metrics on graphs will be backfilled within the next hour.

via statuspage

Lessons Learned

Datadog has experienced 33 incidents in the past year. This frequency suggests systemic reliability challenges that may warrant additional monitoring.

📊Incidents related to api have occurred 186 times across all providers in the past year. This is one of the most common failure categories in cloud infrastructure.

💡This incident is categorized as: API Issue. Consider implementing preventive measures specific to this failure category.