Braze Scales Observability & Customer Support with Datadog

Client

Braze

Industry

Technology & Customer Engagement

AI Tech Solution

AI-Powered Observability & Customer Support

Solution Provider

Datadog

Challenge

Braze, a leading customer engagement platform, processes over 8 billion API requests daily and delivers more than 2.7 billion messages per day. As the company rapidly scaled, its observability and customer support processes struggled to keep up with the growing complexity of its infrastructure. Engineering teams relied on multiple, disconnected tools for performance monitoring, making it difficult to track infrastructure provisioning, forecast future resource needs, and diagnose performance-related issues. Additionally, technical support tickets were frequently escalated directly to Product and DevOps teams, increasing MTTR and delaying customer issue resolution.

Solution

Braze deployed Datadog as a unified observability platform to provide full visibility across infrastructure, application performance, and customer interactions. With Datadog’s application performance monitoring (APM), infrastructure monitoring, and logging solutions, Braze established a standardized observability framework for all eight product teams. Engineers could now track performance metrics, identify service dependencies, and troubleshoot issues efficiently. The Global Services and Support team leveraged Datadog’s real-time dashboards and alerting capabilities to triage customer support tickets faster. Integration with Slack and PagerDuty enabled immediate access to incident data, improving response times. Datadog's template variables and filtering tools allowed Customer Success Managers (CSMs) to analyze customer-specific metrics in real time, empowering them to resolve technical issues without escalation.

Results

By adopting Datadog as its centralized observability platform, Braze significantly improved engineering efficiency, customer support operations, and system reliability. Processing time for email rendering services was reduced by 90 percent, optimizing messaging infrastructure and improving service delivery. All eight product teams now operate on a single, standardized monitoring platform, ensuring full visibility into application performance. Customer support teams gained the ability to diagnose issues independently, reducing engineering team reliance and accelerating ticket resolution times. This led to faster incident response, improved service level adherence, and higher customer satisfaction. With a scalable observability framework, Braze is now positioned to confidently support its growing customer base while maintaining high availability and performance standards.
Read Full Case Story

ITOpsAI Hub

A living library of AI insights, frameworks, and case studies curated to spotlight what’s working, what’s evolving, and how to lead through it.

What you’ll find in AI Blogs & Insights:

  • Practical guides on AIOps, orchestration, and AI implementation
  • Use case breakdowns, frameworks, and tool comparisons
  • Deep dives on how AI impacts IT strategy and operations

Many AI tools symbols in a vertical row. colors purple and blue.

What You'll Find in Resources:

  • Curated reports, research, and strategic frameworks from top AI sources
  • Execution guides on governance, infrastructure, and data strategy
  • Trusted insights to help you scale AI with clarity and confidence

AI Brain on a circuit board. Colors purple, blue

What You'll Find in Case Studies:

  • Vetted examples of how companies are using AI to automate and scale
  • Measurable outcomes from infrastructure, IT, and business transformation
  • Strategic insights on execution, orchestration, and enterprise adoption