Announcing Splunk Federated Search for Amazon S3 Now Generally Available in Splunk Cloud Platform

Splunk is pleased to announce the general availability of Federated Search for Amazon S3, a new capability that allows customers to search data from their Amazon S3 buckets directly from Splunk Cloud Platform without the need to ingest it.

Enterprises rely heavily on cloud object storage services as the de facto destination for their new data to leverage the cost, compliance, security, scalability and manageability benefits that cloud platforms can offer. Amazon S3 is one of the largest services available today, with over 280 Trillion objects all over the world. However, one of the biggest concerns when using cloud storage solutions is data movement, since it can introduce latency and egress costs when attempting to search that data.

To address this challenge, Splunk users can now search data at rest within their Amazon S3 buckets directly from their Splunk Cloud Platform stack, ideal for investigations that require as-needed access to historical, archival, or low-value data. What’s more, you can still run SPL searches, create dashboards, reports, and correlate data between Amazon S3 and Splunk.

It is important to note that data that requires real-time search performance and high access frequency should still be accessed using Splunk Search on indexed data.

Federated Search for Amazon S3 is supported via an integration with AWS Glue Data Catalog, which provides the schema and metadata necessary to read compatible datasets from Amazon S3. AWS Glue Data Catalog tables provide the necessary schema that Splunk Cloud Platform needs to make sense of the data stored in Amazon S3. This also allows Splunk to search popular data formats such as JSON, CSV, Parquet, XML, ORC and more!

In turn, this integration enables Splunk Admins and users to benefit from the following use cases:

  1. Perform forensic investigations directly on historical data stored in Amazon S3 at rest.
  2. Run large statistics searches over historical data in Amazon S3.
  3. Leverage Amazon S3 as part of their data tiering strategy to store data outside of retention period.

Federated Search for Amazon S3 is available for Splunk Cloud Platform stacks hosted on AWS running on version 9.0.2305. Access to Federated Search for Amazon S3 requires a Data Scan Units license for your Splunk Cloud Platform stack. Contact your Splunk sales representative to learn more about this.

For more about Federated Search for Amazon S3, check out the documentation and release notes, dig into our validated architectures, and tune into our webinar on how to seamlessly search your data with Splunk and AWS.

Related Articles

Smarter Root Cause Analysis: Determining Causality from your ITSI KPIs
Platform
2 Minute Read

Smarter Root Cause Analysis: Determining Causality from your ITSI KPIs

Root cause analysis can be a difficult challenge when you are troubleshooting complex IT systems. In this blog, we are going to take you through how you can perform root cause analysis on your IT Service Intelligence (ITSI) episodes using machine learning, or more specifically causal inference.
Smarter ITSI Episodes Powered by Community Detection Algorithms
Platform
6 Minute Read

Smarter ITSI Episodes Powered by Community Detection Algorithms

In this blog we are going to describe how you can create a notable event policy in IT Service Intelligence (ITSI) that is able to group your events using labels generated by unsupervised machine learning in the Smart ITSI Insights App for Splunk – and don’t worry you don’t have to be a data scientist to read this blog!
Making Smarter Predictions in ITSI
Platform
3 Minute Read

Making Smarter Predictions in ITSI

As we are trying to commoditize machine learning through our MLTK smart workflows, this article outlines another example of an MLTK smart workflow, designed to help improve the usability of the predictive capabilities in ITSI.
Detecting Credit Card Fraud Using SMLE
Platform
4 Minute Read

Detecting Credit Card Fraud Using SMLE

In this blog post, we’ll explore an ML-powered solution using the Splunk Machine Learning Environment to detect fraudulent credit card transactions in real time. Using out-of-the-box Splunk capabilities, we’ll walk you through how to ingest and transform log data, train a predictive model using open source algorithms, and predict fraud in real-time against transaction events.
Splunk AR: Admin AR Web App
Platform
2 Minute Read

Splunk AR: Admin AR Web App

Check out how the Splunk AR web app allows administrators to manage their entire AR experience at scale and all in one unified place.
Get to Know Splunk Machine Learning Environment (SMLE)
Platform
5 Minute Read

Get to Know Splunk Machine Learning Environment (SMLE)

An introduction to SMLE Labs and a showcase of the various ML capabilities at a high level by walking you through the environment, step-by-step.
Walkthrough to Set Up the Deep Learning Toolkit for Splunk with Amazon EKS
Platform
6 Minute Read

Walkthrough to Set Up the Deep Learning Toolkit for Splunk with Amazon EKS

Splunk DLTK supports Docker as well as Kubernetes and OpenShift as container environments. In this article, we will go through the setup for using DLTK 3.3 and Amazon EKS as a kubernetes environment.
Advanced Painting with Data: Choropleth SVG
Platform
5 Minute Read

Advanced Painting with Data: Choropleth SVG

Curious about some more advanced use cases with Choropleth SVG in Splunk? Take a look at this blog to find out about animations, custom gauges, and why emojis matter!
Splunk Cloud Self-Service: Announcing The New Admin Config Service API
Platform
3 Minute Read

Splunk Cloud Self-Service: Announcing The New Admin Config Service API

The Admin Config Service is a set of modern REST APIs that will empower Splunk Cloud admins with a simple, yet powerful set of self-service capabilities.