Skip to main content

use case

SLO/SLI monitoring

Keep tabs on important key performance indicators (KPIs) with business workflows.

slo-sli-use-case-dashboard-hero

challenge

Software complexity makes tracking performance challenging

In a cloud-native environment, lots of moving parts can make service-level objective (SLO) compliance difficult to track. Monitoring service-level indicators (SLIs) across different systems is time consuming without a single source of truth.

solution

Centralized SLO/SLI monitoring capabilities

reduce-time-to-detect reduce-time-to-detect

Explore visually

Look at logs and filter for the results that matter most with visualizations.

reduce-time-to-detect reduce-time-to-detect

Get smart alerting

Access context from logs for fast and accurate alerting and exploration.

Measure impact of every change Measure impact of every change

Measure every change’s impact

Correlate and understand how infrastructure health and application performance impact your digital customer experience.

protect

Manage your cloud KPIs

Streamline monitoring your cloud KPIs with out-of-the-box dashboards and configuration wizards. Group together relevant end-to-end traces and transactions based on any microservice, tag or initiating operation.

Stay on top of SLI degradation

Use NoSample full-fidelity tracing to track the performance of business workflows. Get alerts if a threshold is hit, a sudden change occurs or a historical anomaly arises. No traces are sampled. No issue goes undetected.

apm-dashboard-featured
rba

Accelerate troubleshooting

Easily see the impact radius and dependencies across your distributed environment with service maps. Get to root causes at a glance with tag spotlight.

ProductS

Fast-track performance measurement

Deepen your visibility and tap into the most valuable data and context for monitoring SLI/SLO.

RELATED USE CASES

More that you can do with Splunk

fast-flexible-service-excellence fast-flexible-service-excellence

DevOps

Observe all your data to instantly understand and optimize modern infrastructure and applications.

Learn More
application-modernization application-modernization

Application modernization

Measure the impact of software changes, understand service health and consistently deliver great customer experiences.

Learn More
incident-response incident-response

Incident response

Make on-call less stressful, send the right alerts, to the right people, at the right time for improved collaboration and issue resolution.

Learn More

integrations

Splunk App for Content Packs

Your one-stop shop for prepackaged content, out-of-the-box searches and dashboards for common IT infrastructure monitoring sources.

Learn more about service-level indicators and service-level objectives

Service-level objectives (SLOs) are reliability targets for technology products and services. Service-level indicators (SLIs) are the measurements that are compared against those reliability targets. Both SLIs and SLOs are vital to the broader service-level management ecosystem of any organization. 


Each SLI should be measured over a set time period and benchmarked against a corresponding SLO to set a minimum threshold for these targets.

Think of an SLO as a goal or ideal condition for a service to reach, such as 99.999% uptime. A mark on a dashboard would indicate the SLO at 99.999%. In this case, the SLI would be the needle on the dial that varies over a period of time. If that needle (the SLI) dips below the mark (the SLO), the system alerts the site reliability engineer to take action.

SLOs are widely used to measure the performance and reliability of various SaaS products, such as web hosting uptime, cloud storage availability and the latency or responsiveness of various cloud application hosting services.

Here are some typical SLI applications you’re likely to encounter:

  • Server uptime
  • Server latency
  • Error rate
  • Throughput/performance
  • Utilization
  • Data freshness

SLIs can also be used to gauge human performance. For example: service desk responsiveness and escalation level.

Get started

Work smarter. Maximize performance. Investigate and respond in seconds.