Whether the mission is citizen service, public safety and protection, disaster recovery or any other public service undertaking, government agencies increasingly rely on IT—essentially a complex network of geographically dispersed heterogeneous devices, systems and applications. Maximizing their availability and performance is critical for mission success. This is all the more important in the digital age, where citizens expect service levels and interactions that rival commercial industry and timing matters.
This essentially warrants that the systems supporting these interactions are highly available and performing. If there is an outage, agencies need to recover quickly—not in hours or weeks, but in seconds and minutes. But for all their technological investments, the Achilles heel with government IT seems to be lack of end-to-end visibility across these systems, as reported in a government-wide survey, "Challenges & Trends in Public Sector IT Operations." Simply put, agencies don’t have the situational awareness across all their systems and assets, and in many cases, disparate deployments—cloud versus on-premises, for example—lead to blind spots resulting in their inability to quickly pinpoint issues and troubleshoot. And with increasingly mission critical services relying on this intricate IT infrastructure, imminent outage predictions (even if only 30 minutes in advance) can be a boon to IT operations staff.
So, here is a quick exercise. Do you know or have you checked how long it took to get systems back online in your organization after the last outage? According to our survey, it takes an average of 44 hours and involves 12 full-time resources! That’s a little over 13 man weeks—about a quarter of a year! Can you afford to spend that much time troubleshooting and fixing IT outages? Beyond just wasteful time spent by IT, what about the cost of downtime to the mission or the program?
That's why innovative organizations are turning to Splunk. For organizations that need to ensure success of their missions, the Splunk platform delivers real-time monitoring, troubleshooting and imminent outage prediction capabilities to maximize availability and performance of mission critical systems and applications that support their initiatives.
The National Ignition Facility (NIF) at the Lawrence Livermore National Labs (LLNL) is a case in point. It houses the world’s largest laser with a core mission of stewardship of our country’s nuclear arsenal. Splunk sits at the heart of the NIF’s Integrated Computer Control System (ICCS), which manages more than 66,000 control points to power NIF’s massive laser facility, ensuring the health of all devices in addition to their complex IT infrastructure including maximizing system uptime and performance. In addition, leveraging Splunk's machine learning capabilities, NIF engineers can detect when these sensors begin to decay, and perform predictive maintenance, avoiding unscheduled downtime.
We sleep better at night knowing that our country’s nuclear arsenal is in the very capable hands of these brilliant NIF engineers. And Splunk for added protection! Don't you?
You can hear directly from Phillip Adams, CTO and Chief Architect at the NIF at our webinar "Real-Time IT Monitoring: Your Mission Depends on It" on Tuesday, November 13. He will be joined by Jonathan Fair from InfoTeK (contractor to Joint Service Provider) and John Israel from August Schell where they will discuss how real-time monitoring is helping them keep their missions on track and how Splunk is helping them get it done. If mission success is important to you, I think you'll find this very well worth your time.
Until next time,