Before joining Splunk, I had the opportunity to lead observability operations for several Fortune 500 companies. During that time, we shifted from traditional monitoring vendors of the time to more agile, best-of-breed, often niche observability solutions. This shift gave our teams flexibility to move fast and instrument what mattered, via self-service.
One of those tools (not to be named) became our core metrics engine. Self-service adoption took off. Developers and SREs had visibility like never before.
But then the telemetry sprawl hit!
Our metric volume exploded. Cardinality was through the roof. And worse, we had no insight into which metrics were actually useful. When we asked for help, the vendor’s response was an “opportunity” to renew our annual contract early. We knew we had a problem, but we had no way to clearly identify it — let alone fix it — without risking critical coverage.
That’s exactly the kind of problem Splunk Observability Cloud’s Metrics Management capabilities are built to solve.
In modern observability, more data isn’t always better. As SRE, DevOps, and platform teams adopt self-service instrumentation, metric volume tends to grow unchecked. Add in the growing complexity of today’s IT environments — distributed architectures, ephemeral infrastructure, and multi-cloud sprawl — and it’s easy for telemetry to become overwhelming.
The result? Noise, confusion, and surprise overages that are tough to trace back to specific sources.Most organizations struggle to answer basic questions like:
Splunk Observability Cloud’s Metrics Management capabilities helps you answer these questions and take back control. And the best part? You can do it all centrally, without editing collector configs or backend systems.
With Metrics Management, you can:
Capability | What it Does | How to Use it / When It Helps | Category |
---|---|---|---|
Usage Analytics | Centralized and filterable view of metrics across your entire deployment, allowing you to perform criteria-based deep dives to optimize | Identify unused, redundant, or high-cardinality metrics and zero in on optimization opportunities | SEE |
Metric Profile View | Deep dive into a specific metric’s context. This includes dimensions, data sources (tokens), associated charts and alerts | Pinpoint ownership, analyze cardinality drivers, and assess potential blast radius before making changes to metrics | UNDERSTAND |
Metrics Pipeline Management (MPM) | You’ve found the opportunity and clearly understand what to do. Pipeline Management provides a point-and-click UI to drop, archive, or aggregate with confidence | Execute metric cleanup centrally without touching collector configs. | ACT |
In this article we’ll double-click into each of the capabilities with a quick overview and some practical guidance on how you might leverage it.
The Usage Analytics view shows:
This is your source of truth for what’s being ingested, used, or just wasting space.
Here is a detailed overview of the fields in the table
Field | Description |
---|---|
Metric name | The name of the metric. |
Billing class | Class of metric for billing purposes (host, billing, or custom). To learn more about billing classes, see Metric categories. |
Utilization | Whether the metric is used. “Unused” indicates that the metric is producing MTS, but these values aren’t utilized in Splunk Observability Cloud. |
Utility score | Indicates how much the metric is used. A high utility score means higher usage. |
Metric time series (MTS) | The average number of MTS associated with this metric, measured per hour. |
Percentage of total | How much of your total usage plan this metric utilizes. |
Pro Tips:
Clicking into a metric, within the “Usage Analytics” dashboard, opens the Metric Profile, where you’ll find:
The Metric Profile view allows you to better understand what is contributing to the MTS count (often dimensions) and where the metric is being leveraged in your deployment. This allows you to make data driven optimizations, such as leveraging Metrics Pipeline management to archive, aggregate, and/or drop metrics.
Here is a detailed overview of the fields in the table
Field | Description |
---|---|
Dimensions | Displays the dimension name of each metric sorted by average hourly MTS count. High-cardinality dimensions appear at the top of the list. |
Tokens | Displays the token name and ID for each metric, sorted by the number of metric time series associated with the token. |
Charts | Displays the charts and dashboards associated with each of your metrics, as well as the user who last updated the chart and the time they updated it. |
Detectors | Displays the detectors associated with each of your metrics, as well as the user who last updated the detector and the time they updated it. |
Pro Tips:
Again, this is the context you need to make confident, data-driven decisions about what to optimize or keep.
Once you know what needs cleanup, Metrics Pipeline Management (MPM) gives you the tools to do it easily. Quickly start by clicking the blue “Create Rule” button from the metric profile page.
With MPM’s point-and-click interface, you can:
In the screenshot, a simple rule reduces raw MTS by 66% just by removing an overly verbose dimension.
Dramatically reduce MTS volume while preserving insight. In MPM, use aggregation rules to group and roll up MTS by meaningful dimensions — like region or service — and drop noisy dimensions, such as container_id. Keep the insight, quiet the noise.
Pro Tips:
This is where savings happen. MPM gives you the ability to optimize metrics on your terms, with full visibility and control.
Metric sprawl can sneak up on even the most mature teams, especially when SREs, DevOps, and platform teams fully embrace self-service. What starts as healthy adoption can quickly turn into a tangle of unused data, rising bills, and unclear ownership.
And with the ongoing explosion of tools, services, and telemetry sources across increasingly complex environments, there’s only more data coming. The challenge isn’t just about scale, it’s about (re)gaining control before things .
Splunk’s Metrics Management gives you the tools to fight back. These capabilities provide visibility into what’s being collected, clarity on what matters, and a simple interface to take action when things get out of hand or when opportunities for optimization arise.
Ready to Get Started? To determine your current metrics utilization, understand how/if they are being used, and centrally optimize them via pipeline management, follow these simple steps!
Need additional help? Check out the official docs or connect with your Splunk account team. We're happy to guide you through it.
If you're facing growing costs, unclear metric usage, and no easy way to optimize, you're not alone. Start your 14-day free trial of Splunk Observability Cloud today and experience how easy it is to take back control of your metrics.
Looking for a platform that delivers these must-have features? Splunk Observability Cloud is a leading modern observability platform that supports end-to-end visibility and enables self-service observability across the enterprise.
Check out this Splunk Tech Talk that shows these concepts in action:
The world’s leading organizations rely on Splunk, a Cisco company, to continuously strengthen digital resilience with our unified security and observability platform, powered by industry-leading AI.
Our customers trust Splunk’s award-winning security and observability solutions to secure and improve the reliability of their complex digital environments, at any scale.