SPLUNK FOR GOOD

Updated RSS Input (Java Version)

Last year, I put out a Java version of a RSS Input program that was based on included open source to parse RSS. It used the beta version of the Splunk Java SDK to check for duplicates to make sure in a reasonable time period the same RSS link wasn’t already indexed into Splunk. With the GA release of the Splunk Java SDK, I updated the contents on Splunkbase to include the GA Splunk Java SDK jar file and also used a more efficient way to check for a duplicate entry. You can download the distribution on Splunkbase.

To recap, the distribution uses a scripted input to index the contents of configurable RSS feeds every configurable N seconds. You can then use Splunk to look at historical RSS headlines and correlate events with other things that you might be indexing. You can also use the included workflow action to read the article if it is still available from the originator.

We should thank the genius of Aaron Swartz for helping create the RSS protocol. RIP Aaron.

Nimish Doshi
Posted by

Nimish Doshi

Nimish is Director, Technical Advisory for Industry Solutions providing strategic, prescriptive, and technical perspectives to Splunk's largest customers, particularly in the Financial Services Industry. He has been an active author of Splunk blog entries and Splunkbase apps for a number of years.

TAGS
Show All Tags
Show Less Tags