Braze Scales Observability & Customer Support with Datadog
Client
Braze
Industry
Technology & Customer Engagement
AI Tech Solution
AI-Powered Observability & Customer Support
Solution Provider
Datadog
Challenge
Braze, a leading customer engagement platform, processes over 8 billion API requests daily and delivers more than 2.7 billion messages per day. As the company rapidly scaled, its observability and customer support processes struggled to keep up with the growing complexity of its infrastructure. Engineering teams relied on multiple, disconnected tools for performance monitoring, making it difficult to track infrastructure provisioning, forecast future resource needs, and diagnose performance-related issues. Additionally, technical support tickets were frequently escalated directly to Product and DevOps teams, increasing MTTR and delaying customer issue resolution.
Solution
Braze deployed Datadog as a unified observability platform to provide full visibility across infrastructure, application performance, and customer interactions. With Datadog’s application performance monitoring (APM), infrastructure monitoring, and logging solutions, Braze established a standardized observability framework for all eight product teams. Engineers could now track performance metrics, identify service dependencies, and troubleshoot issues efficiently. The Global Services and Support team leveraged Datadog’s real-time dashboards and alerting capabilities to triage customer support tickets faster. Integration with Slack and PagerDuty enabled immediate access to incident data, improving response times. Datadog's template variables and filtering tools allowed Customer Success Managers (CSMs) to analyze customer-specific metrics in real time, empowering them to resolve technical issues without escalation.
Results
By adopting Datadog as its centralized observability platform, Braze significantly improved engineering efficiency, customer support operations, and system reliability. Processing time for email rendering services was reduced by 90 percent, optimizing messaging infrastructure and improving service delivery. All eight product teams now operate on a single, standardized monitoring platform, ensuring full visibility into application performance. Customer support teams gained the ability to diagnose issues independently, reducing engineering team reliance and accelerating ticket resolution times. This led to faster incident response, improved service level adherence, and higher customer satisfaction. With a scalable observability framework, Braze is now positioned to confidently support its growing customer base while maintaining high availability and performance standards.
Read Full Case Story
ITOpsAI Hub
A living library of AI insights, frameworks, and case studies curated to spotlight what’s working, what’s evolving, and how to lead through it.