Back to blog

Getting the Most Out of the Least With Datadog APM

Cut through duplicate Datadog dashboards clutter with template variables.

X

min read

September 8, 2022

Nick Vecellio

Something I find quite frequently within Datadog is an overabundance of dashboards. That’s not to say that having a lot of dashboards is a bad thing; sometimes it’s necessary! What I’m talking about is a lot of dashboards that are duplicated, but don't need to be. Most times this is an easy fix - we take a solid dashboard and use template variables to make them a bit more dynamic. Instead of having a dashboard for dev, staging, and prod, we make one that has an $env variable that we can just swap and have the graphs change accordingly.

But…there’s one spot where this isn’t going to work as well, and that’s when it comes to APM metrics. In the metric summary page, if we take a look for trace.* metrics, we’ll see something like this:

And if we were to look a little closer…

We’re looking at the metrics sent by the Datadog Tracer Agent. These are used to make the basic APM service visualizations that you have access to via the service summary page. On closer inspection of these metric namespaces, we’ll see that after trace we have the name of the submitting service, and after the service name we have all sorts of different breakdowns of the metric values (like duration, errors, etc). If we’re running an environment that is solely based around a few technologies, it may be totally acceptable to create a dashboard using these metrics.

However, what happens when we’re in an environment that has numerous types of caches, databases, and microservices written across a number of languages? We will need to create a dashboard for each type of service to view JUST basic HTTP metrics, meaning we need to sort through hundreds of metrics, determine what they are and what they mean, and then lay them out on a dashboard. In large environments, this is very difficult to maintain, and creates a higher barrier to entry for monitoring a new service. So how do we fix this?

Datadog offers APM Retention Filters; by enabling this globally or for certain services, Datadog performs an analysis of APM data and allows us a universal dashboard widget that allows us to perform calculations that are (effectively) service agnostic (As a note - Retention Filters have a certain amount of retained traces that are included in your subscription, but additional charges will apply once that threshold is crossed).

Before enabling Retention Filters, our dashboard panels would look something like this:

After Retention Filters….

Furthermore, this panel gives us a significant number of options to filter, arrange, and calculate our data.

By adding a simple split by Resource on this panel, we can now see the average duration of requests against every different resource name presented by this application. While this is a simple example of the use case, the power here comes from not having to know or care about the name of the metric (is it trace.http.request.duration? Or is it trace.rails.request.duration?), which allows you to easily set variables for $service and $env and dynamically switch between different services and environments.

The end result can be a singular dashboard that shows you everything that you would care about at a high level for any application - request latencies, errors, http error codes, and so on - without having to create and maintain multiple bespoke dashboards for every individual application. You can share this dashboard among any application team and have them get the same information as everyone else! The cherry on top, though? Any new app now has a dashboard with no additional effort or turnaround time, reducing your engineering time and streamlining the go-live process.

We don’t believe in hoarding knowledge

We go further and faster when we collaborate. Geek out with our team of engineers on our learnings, insights, and best practices.

Blog Posts

Datadog

Blog

Implementing Datadog Cloud Security Posture Management

Best practices for implementing Datadog Cloud Security Posture Management without the noise

Datadog

Blog

RapDev’s DASH 2026 Highlights & Takeaways

A look at RapDev's biggest DASH yet: five consecutive Partner of the Year wins and much more

View all posts

Resources

Datadog

Video

Migrate from Legacy Tools to Datadog Seamlessly

Learn how to modernize your observability stack with a migration approach built for speed and stability

Datadog

Video

Building a Modern Incident Response Model with Datadog and RapDev

Learn how to unify Datadog and ServiceNow for faster, AI-powered incident response

View all resources

Datadog Expertise

Datadog

Featured

RapDev & Datadog Overview

Datadog

Featured

Deploying Monitoring as Code with oneZero

Datadog

Featured

Reducing Costs & Noise with a Splunk-to-Datadog Migration

Datadog

Featured

Transforming Security Operations with Managed SOC Expertise

Datadog

Featured

Operationalize AI with Datadog Bits AI SRE

Datadog

Featured

Transforming Observability Operations with RapDev’s Managed Datadog Expertise

Datadog

Featured

How Wawa Maximizes Observability ROI with Datadog

Datadog

Featured

Datadog Observability Maturity Assessment Workshop with RapDev

Datadog

Featured

Implementing Centralized Monitoring & Incident Management at BCG

ServiceNow

Featured

ServiceNow Overview

ServiceNow

Featured

ServiceNow Agentic AI Implementation

ServiceNow

Featured

Reclaiming Visibility & Uptime with ServiceNow

ServiceNow

Featured

ADT’s Onboarding & Automation Journey with ITOM, ITAM, & ITSM

ServiceNow

Featured

Scaling Hardware Asset Lifecycles with Self-Service HAM

ServiceNow

Featured

Re-Engineering Envision’s SPM to Align Strategy & Execution

ServiceNow

Featured

Re-Engineering Envision’s SPM to Align Strategy & Execution

ServiceNow

Featured

CI/CD Automation & Cutting Investigation Time for Northern Trust

ServiceNow

Featured

Improving CMDB Data Quality for a National Healthcare Provider

ServiceNow

Featured

Smarter Vulnerability Response in 12 Weeks at Sallie Mae

ServiceNow

Featured

The Journey to Automate Everything for a Global Insurance Company

Best Practices

Datadog

July 22, 2026

Implementing Datadog Cloud Security Posture Management

Datadog

July 8, 2026

Four Ways to Secure Your Datadog Organization Settings

RapDev

May 19, 2026

Building Internal Tools That Stick

Agentic AI Posts

ServiceNow

July 30, 2026

Not Every ServiceNow AI Problem Needs an AI Agent

ServiceNow

July 14, 2026

An Agentic Self-Healing Incident Pipeline Powered by Now Assist

Datadog

RapDev