Can You Predict Which Blogs Someone Will Like?

Aside from Big data being the new big buzzword, we’re also seeing an uptick in mentions of Data Science and Data Scientist. As a recent Fortune article: The hot tech gig of 2022: Data scientist indicated, by the end of the decade 50 billion devices will be emitting information nonstop, and data scientists will help to manage–and make sense of–all that data.

As such we at Splunk are hoping to address at least part of the problem data scientists face–collecting, correlating and searching the data. Splunk can help data scientists spend less time on the up front data parsing, then they can get to the fun (and valuable part)–searching, asking questions of, modeling and trending their data.

Margit Zwemer (@MPZwemer), Data Sceintist and Evil Genius in Residence at Kaggle was excited to use Splunk for her data science endeavors–I’m always excited to use something new, and with Splunk it’s amazing how I can get in and start looking at the data right away. It’s a huge time savings.”

So we decided to partner with Kaggle to expose Splunk to data scientists. This morning at GigaOm Structure (#structureconf), a conference aimed at managing large-scale cloud infrastructures, GigaOm, Kaggle and Splunk using data from WordPress.com announced the GigaOm Data Challenge. The goal of the competition is to predict which people will “like” which blog posts from across 90k active blogs on WordPress.com. (They host 74 million WordPress sites globally; over 16% of all domains on the web). The winning solutions may be used by WordPress.com in a recommendation engine, but winning solutions must be open-sourced, so they could be used by anyone to solve a similar problem using similar data in a similar domain.

For Splunk’s part, we’re making the Splunk platform available for the GigaOm Data Challenge. We believe that understanding the connections and trends in your data can open up all sorts of new discoveries (#datajourney), and  we are excited to work with data scientists across the globe to uncover the patterns and intelligence in massive data sets across different domains. There’s a prize earmarked solely for the best use of Splunk—so we hope to see many folks participating in the contest!

Tools to get you started with Splunk:

Predicting likes on WordPress, just another way you can use Splunk to answer the question: Where will your data take you?

The opportunity is there. Do you know where your data will take you? Join us at .conf2012, Splunk’s 3rd Annual Worldwide Users’ Conference to learn where your data can take you:http://www.splunk.com/goto/conf

Erin Sweeney
Posted by Erin Sweeney

Join the Discussion