TalkTalk Uses Splunk to Detect Problems Early and Improve Network Performance
Providing landline, broadband, fiber, TV and mobile services to UK consumers and businesses, TalkTalk aims to use data and automation to become the country’s most recommended communications provider. To meet growing consumer and business demand for bandwidth, TalkTalk has set the strategic goal to simplify operations across its business. Since deploying Splunk technology, TalkTalk has seen benefits including:
- Improved network performance, increased reliability and happier customers due to early detection of systems problems
- A more positive customer experience and lower costs through proactive monitoring
- A better understanding of customer behavior with Splunk dashboards
SPLUNK USE CASES
- Legacy systems made it difficult to gain sufficient data on network performance
- Limited ability to identify problems quickly and effectively
- Lacked pertinent and timely information on service outages and affected customers
- Created a more complete picture of the network by combining performance metrics
- Drastically reduced cases of underperformance across more than 5,000 exchanges
- Increases network reliability and strengthens brand reputation while providing opportunities for future cost savings
- Improves customer experience and uptime through the ability to detect network outages almost instantly and pinpoint which customers are affected
- DNS resolvers
- JDSU revelations
- LDAP logs
- Network Index
- OSS logging
- RADIUS Accounting
- RADIUS Authentication
- VIAVI test heads
After suspecting that its domain name system (DNS) was underperforming, TalkTalk turned to Splunk to bring data to questions and decisions around its existing system. Thanks to Splunk technology, TalkTalk identified the root problem, which enabled the team to present a data-supported business case to replace its entire DNS platform. This success unleashed new possibilities for the TalkTalk team, raising the question of what other problems Splunk could help solve. “We started to feed data into Splunk and then, just like magic, we now have this great world of performance driven largely by Splunk,” says Paul Emmett, head of network operations.
Having operated a network operation center (NOC) for years, TalkTalk monitored network services in a traditional way, yielding what the company termed “binary measures of network performance.” Yet with Splunk Enterprise in place, TalkTalk united these metrics to gain a better understanding of service performance as a whole. For Matt Wood, the company’s head of labs, the Splunk platform’s ease of use, rich functionality and wide availability of apps stood out the most. “It’s very easy to use. You can just dive in,” he says.
Improved network reliability, happier customers
Before deploying Splunk, TalkTalk struggled with thousands of cases of underperformance across its roughly 5,000 exchanges. Once Splunk software was installed, the communications provider began identifying problems much faster, resulting in far fewer of these “red exchange” incidents. “This week, I recorded one red exchange incident against fiber and about six against copper services,” Emmett says. “So we’ve gone from several thousand issues to what we can count on both hands.”
With improved network performance and reliability, TalkTalk can further strengthen its brand reputation while meeting customers’ rising bandwidth expectations. TalkTalk believes these optimized systems will also yield future cost savings since many customers select their broadband services based on metrics like connectivity and reliability.
A strategic response to service outages
Because the team didn’t have access to pertinent, timely information, services outages were once very costly and time -consuming for TalkTalk. With Splunk, however, the company now detects an outage within seconds or even milliseconds — as opposed to minutes previously — and can precisely pinpoint which customers are affected.
Building on this, the company is implementing a project designed to make outage data rapidly available to its CRM systems. With greater visibility into an outage’s cause and status, the TalkTalk team can better serve, inform and help any affected customer who calls. The project also aims to optimize time and resources with a more effective system that dispatches engineers and replacement routers only when necessary.
“It’s about spotting where we have a flawed process, then using the Splunk platform to provide us with a list of affected customers so we can fix the problems using robotic process automation (RPA). Splunk helps us tactically fix processes because it gives us access to the data.”
Head of Network Operations, TalkTalk
Turning data into action, daily
Given its marked success across the busines s, Splunk has become an integral part of TalkTalk’s operations, with Wood and Emmett relying on its technology every day. “We do 60 million network performance tests per day to look at the performance of broadband services across the UK , and Splunk detects problems instantly,” Emmett says. Splunk’s ability to bring data to every question and action has led the TalkTalk team to explore additional use cases, such as using Splunk within IT operations to detect problems faster and more effectively.
“We started to feed data into Splunk and then, just like magic, we now have this great world of performance driven largely by Splunk.”
Head of Network Operations, TalkTalk