Delayed APM metric ingestion

highDatadogMonitoringDec 12, 2025 18:37Duration: 3h 15m
api
API Issue

Summary

All impact related to APM metrics has been resolved. A separate incident has been created to track the remaining impact in live process data.

Impact

major

Timeline

Dec 12, 2025 18:37

[investigating] We are currently investigating lag in ingesting apm metrics, which affects monitor evaluation.

via statuspage
+18m
Dec 12, 2025 18:55

[investigating] We are continuing to investigate this issue.

via statuspage
+31m
Dec 12, 2025 19:26

[investigating] We are currently investigating lag in ingesting apm and process metrics, which affects monitor evaluation and in some cases led to incorrect monitor alerts.

via statuspage
+1h 27m
Dec 12, 2025 20:53

[identified] We have identified the issue affecting ingestion delays in apm and process metrics and are working on recovery

via statuspage
+59m
Dec 12, 2025 21:52

[resolved] All impact related to APM metrics has been resolved. A separate incident has been created to track the remaining impact in live process data.

via statuspage

Lessons Learned

Datadog has experienced 33 incidents in the past year. This frequency suggests systemic reliability challenges that may warrant additional monitoring.

📊Incidents related to api have occurred 186 times across all providers in the past year. This is one of the most common failure categories in cloud infrastructure.

💡This incident is categorized as: API Issue. Consider implementing preventive measures specific to this failure category.