What is cardinality in the context of metrics monitoring?

Cardinality refers to the number of unique combinations of metric names and their associated dimensions or labels in a monitoring system.

Why is high cardinality a problem in observability systems?

High cardinality can lead to increased storage requirements, slower query performance, and higher costs in observability systems.

How does cardinality impact metrics storage and performance?

As cardinality increases, the number of unique time series grows, which can strain storage resources and degrade query performance.

What are some strategies to manage high cardinality in metrics?

Strategies include limiting the number of dimensions, aggregating data, filtering unnecessary labels, and using sampling techniques.

What are examples of high cardinality labels?

Examples include user IDs, session IDs, or any label that can have a large number of unique values.

Learn

April 07, 2025

4 Minute Read

Cardinality Metrics for Monitoring and Observability: Why High Cardinality is Important

By Chrissy Kidd

Key takeaways

Cardinality refers to the uniqueness of data within a dataset: In observability, high-cardinality metrics — such as unique user IDs or session IDs — provide granular insights but can increase complexity and resource usage in monitoring systems.
Balancing cardinality is key to effective monitoring: While high-cardinality data offers detailed visibility, excessive levels can lead to performance bottlenecks. Organizations should focus on capturing only the most meaningful metrics.
Managing cardinality improves observability and scalability: By optimizing data collection and storage strategies, teams can gain actionable insights without compromising system performance or cost-efficiency.

The shift to cloud-native infrastructure has transformed how we collect, store, and analyze metrics — bringing massive increases in data volume and cardinality.

In this article, we explore the concept of cardinality and its growing importance in modern monitoring and observability systems, especially in cloud-native environments.

What is cardinality?

Cardinality refers to the number of unique values in a dataset or field. A field or dataset may be either high cardinality or low cardinality:

A field containing user IDs or transaction types may have thousands of distinct entries — this is high cardinality.
In contrast, a field like "status" with values like "success" or "error" would have low cardinality because there’s only a few distinct values, repeated.

In the context of observability, cardinality determines how finely we can slice and analyze our metrics. High cardinality provides rich insight but introduces challenges around scalability, performance, and cost. (More on these challenges later.)

See how low vs. high cardinality compares:

This infographic illustrates the difference between high and low cardinality in monitoring metrics. High cardinality involves many unique values — such as user IDs or transaction types — enabling granular insights but increasing system load. Low cardinality metrics, like server status or boolean flags, offer broader trends with less overhead. Understanding the trade-offs is key to designing efficient observability systems.

Cardinality metrics

In monitoring and observability systems, the term “cardinality metrics” refers to metrics that track the number of unique combinations of a metric name and its associated labels or dimensions.

Here’s an example: A metric might be something like http_requests_total. Labels/dimensions are additional data points that give context, like:

method=GET
status=200
region=us-west

Each unique combination of these label values creates a new time series. The more unique combinations, the higher the cardinality.

So, let’s say a simple app has 10 HTTP methods, 50 status codes, and 20 regions, you could have 10,000 unique time series, just for a single metrics.

Why cardinality matters for monitoring and observability

High cardinality fields enable granular tracking of individual records or events. In monitoring, high cardinality helps to:

Differentiate between multiple instances of a service.
Track performance metrics per user or region.
Detect anomalies that broad aggregations might miss.

Observability takes things further — combining logs, metrics, and traces to deliver a complete picture of system behavior.

High cardinality is critical for debugging distributed systems because the higher the detail, the easier it is for engineers to trace individual user requests across microservices, identify anomalies within specific regions or clusters, and analyze vast datasets without losing crucial details.

We can sum up how high cardinality is so valuable, because it delivers:

Granular insights of individual services, endpoints, users, or transactions. This allows for precise anomaly detection and root cause analysis.
Improved filtering and querying thanks to high-dimension data based on specific attributes — region, customer ID, API version, for example — speeds up problem identification and diagnosis.
Enhanced customization and flexibility means teams can create tailored dashboards, alerts, and reports using unique combinations of metric dimensions, leading to more meaningful insights.

Challenges of high cardinality

Despite its benefits, high cardinality introduces significant operational challenges:

Scalability. Each unique combination of metric and label results in a new time series. This can quickly escalate to millions of series, overwhelming traditional time series databases (TSDBs).
Storage overhead. High cardinality metrics consume more storage, especially as time series grow with constant inserts. Efficient retention, aggregation, and compression strategies become essential.
Query performance. Complex queries across high-cardinality data are slower and more expensive to compute, especially when joins or long time ranges are involved.

How modern systems handle these challenges

Modern observability platforms like Splunk Observability Cloud are built to support high cardinality at scale. These platforms:

Index all metric tags equally.
Enable fast queries across millions of time series.
Provide real-time alerting without performance trade-offs.

Platforms like Splunk handle high cardinality and high dimensionality data, allowing users to freely query and filter on any attribute regardless of its cardinality.

Real world use cases

Workload monitoring. Moving beyond system metrics like CPU or memory, workload monitoring focuses on application-specific metrics such as:

Request latency
Error rates
Throughput

These metrics are often high cardinality due to their association with specific services, endpoints, or user actions.

(Related reading: common SRE metrics and monitoring for RED rate, errors, and duration.)

E-commerce. In an online store, each customer and order has unique identifiers. High-cardinality metrics allow monitoring by user ID or order ID, enabling use cases like:

Fraud detection
Regional trend analysis
Issue isolation by payment provider or shipping region

Without this level of detail, troubleshooting and incident resolution would be limited to general trends, making it harder to identify the root cause.

Together, all of this makes a better customer experience.

Wrapping up

Cardinality metrics are essential to modern monitoring and observability. They empower teams to diagnose issues with precision and maintain performance across complex, cloud-native architectures. But with great detail comes great responsibility — managing high cardinality requires thoughtful system design, efficient tooling, and disciplined metric instrumentation.

Understanding and harnessing cardinality can make the difference between reactive firefighting and proactive performance management.

See an error or have a suggestion? Please let us know by emailing splunkblogs@cisco.com.

This posting does not necessarily represent Splunk's position, strategies or opinion.

Observability Topics

Chrissy Kidd

Chrissy Kidd is a technology writer, editor, and speaker. The managing editor for Splunk Learn, Chrissy has covered a variety of tech topics, including cybersecurity, software development, and sustainable technology. She's particularly interested in how tech intersects with our daily lives.

Learn 7 Min Read

The Digital Resilience Guide: 7 Steps To Building Digital Resilience

When faced with major changes and cyberattacks, digital resilience ensures our systems can bounce back. See more, react fast. With digital resilience.

Learn 11 Min Read

Storage Monitoring: A Brief Introduction

In this blog post, we'll take a look at the basics for Storage Monitoring, best practices, definitions, and more.

Learn 3 Min Read

Advanced Encryption Standard & AES Rijndael Explained

Learn all about AES Rijndael, today's go-to algorithm that won a NIST competition for ensuring data confidentiality — and it does much more than that!

About Splunk

The world’s leading organizations rely on Splunk, a Cisco company, to continuously strengthen digital resilience with our unified security and observability platform, powered by industry-leading AI.

Our customers trust Splunk’s award-winning security and observability solutions to secure and improve the reliability of their complex digital environments, at any scale.

Learn more about Splunk

Subscribe to our blog

Get the latest articles from Splunk straight to your inbox.

Connect with Splunk on X

Follow @Splunk

Connect with Splunk on Instagram