observability

Splunk AI Observability

Observe and optimize the performance, quality, cost, and security of your entire AI stack.

Free trial Try Splunk Observability Cloud free for 14 days.

Take a guided tour Got 5 minutes? Get a quick look at how it works.

Cisco acquired Galileo Technologies, Inc.

Galileo, an AI observability leader, will help us ensure AI is more reliable, trustworthy, safe, and observable.

Learn more

Build trust and reliability across the AI stack

Evaluate, improve, and safeguard agents throughout the agent development lifecycle (ADLC)

Observe and optimize the performance, quality, security, and cost of LLM and agentic applications

Pinpoint the root cause of degraded agent and model performance, quality, and behavior. Trace and map agent workflows from request to response. Evaluate performance — latency, errors, token usage — alongside quality and security metrics like hallucinations, bias, prompt injection, and PII leakage to safeguard and improve user experiences.

Observe and optimize the health, availability, and usage of AI infrastructure

Track and visualize operational and tokenomics metrics like GPU and memory utilization, power consumption, and time-to-first-token for model providers, vector databases, and other AI components. This helps you better manage costs, understand business impact, and proactively alert on bottlenecks and spikes that lead to performance degradation and inefficiency.

Ensure AI performs as intended and at the right cost

Explore the documentation

Out-of-the-box quality evaluations

Evaluate model and agent performance, quality, and behavior in real time

Track and measure output quality with scores that detect issues like hallucinations, bias, relevance, toxicity, sentiment, and other out-of-the-box evaluators.

Token usage and cost

Control costs and optimize resources across the AI stack

Track token usage, operational expenses, GPU and memory utilization, and other tokenomics and GPU-related metrics from specific requests, models, agents, workflows, and other AI infrastructure components over time.

Built-in guardrails and controls

Pinpoint and mitigate AI security risks in real time

Safeguard and improve models with security, privacy, and safety guardrails for PII, PHI, and PCI leakage, tool misuse, and prompt injection. Comply with AI security standards to confidently build and deploy trustworthy AI applications and systems.

Agent performance analysis

Observe agent performance

Track the requests, errors, latency, token usage, and quality scores of individual agents over time to establish baselines, detect outliers, and make data-driven decisions.

pd-o-ai-olly-features-performance-analysis

Agent workflow analysis

Visualize the sequence of steps, dependencies, and handoffs

See the associated tool calls, models, and retrieval steps of an agent workflow — from request to response — to investigate failures.

Interaction-centric trace views

Get trace-level visibility with AI details, tags, and span details

View inputs, outputs, and system prompts alongside quality metrics that are associated with each step of the agent workflow for end-to-end root cause.

ai integrations

Integrations to observe the entire AI stack with Splunk

View all integrations

RESOURCES

Explore more from Splunk

Monitor LLM and agent performance with AI Agent Monitoring in Splunk Observability Cloud

Read the blog

Related capabilities

Splunk Application Performance Monitoring

Full-fidelity tracing and always-on profiling to enhance app performance.

Learn more

Splunk Infrastructure Monitoring

Real-time monitoring of cloud, hybrid, and on-prem environments.

Learn more

Splunk AppDynamics

Observe hybrid and on-prem applications across every environment.

Learn more

Splunk AI SRE

Agentic AI across the entire incident lifecycle.

Learn more

Get started with Splunk

Discover Splunk products and build digital resilience for the agentic AI era.

Request a demo

Explore free trials

See Splunk Cloud Platform in action

See why Splunk is an 11-time Leader in the Gartner® Magic Quadrant™ for SIEM

See how Splunk is a 3-time Leader in the Gartner® Magic Quadrant™ for Observability Platforms

Support

observability

Splunk AI Observability

Cisco acquired Galileo Technologies, Inc.

Build trust and reliability across the AI stack

Observe and optimize the performance, quality, security, and cost of LLM and agentic applications

Observe and optimize the health, availability, and usage of AI infrastructure