Know Your Customer Again Revisited

At the end of last year, I wrote about using Splunk to monitor the Know Your Customer (KYC) use case that is a regulation in most Financial Services Institutions in many countries. The last part of the regulation states that continuous monitoring of your customers in terms of their interactions and transactions needs to take place.

In any bank, there are many types of transactions covering various things such as core banking, ATM, wire transfers, credit card use, payments, etc. Every application involved in these activities produces its own time services log data that is used for troubleshooting, security tracking, and analytics. Let’s revisit the example I presented last year. Suppose we are only monitoring a core banking feature for deposits and withdrawals for each customer. The simplest possible representation of this can be given with this example table, which is from the previous KYC blog.

Timestamp
AccountID
amount
11/2/2022 5:06:30
123
50
11/2/2022 5:06:30
456
6345
11/2/2022 5:06:30
123
53
11/2/2022 5:06:30
456
4353
11/14/2022 9:46:30
123
51
11/14/2022 9:46:30
456
6345

What was suggested last year is to use the Splunk stats command to find the average amount per account ID for every entry and then find any account that is more than N standard deviations from the average of the entity itself. For instance, if account ID 123 usually has an amount around 50 and then suddenly transacts 10,000, this would be an outlier that would easily be found. Yes, we can do this exercise on paper, but with a million accounts and monitoring each account separately requires continuous monitoring. We can then collect the outliers per account in a risk index and score them accordingly for further analysis.

Easily Operationalizing the Approach

Everything I suggested above still applies, but we recognize that not everyone knows Splunk Processing Language (SPL) or how to effectively collect this generated data per entity into a Splunk index.

Fortunately, Splunkers Rupert Truman and Josh Cowling created a free Splunkbase application called the Splunk App for Behavior Profiling, which can automate the KYC use case as long as we have the data for each functional banking domain. To continue the discussion with our example, let’s use their app, which is web driven. The only SPL I’ll use is to search for all events for a given sourcetype. My data is fictitious and several years old, but it still illustrates the point.

In the web page, after searching for the data within a time range, we pick a field to group by, which in this case is the unique customer name and the field that is going to be monitored for outliers, which is the amount field here. Sample results for the search and fields in question are shown automatically by the web page to continue.

Next, we pick a statistical function for the amount field (average) and split it by each unique customer. We can also do the average in time span buckets such as every hour or day.

Finally, we save this as a rule to collect the data to find the average amount per customer over a given time period as a scheduled search.

After the data is automatically collected within a summary index, we can use the web interface workflow for the indicators to score for standard deviation outliers, which go to a scoring index to stack rank them. This automation can be done for each functional domain in the FSI world such as ATM, credit cards, payments, wire transfers, etc., which makes continuous monitoring an easier task. The app also provides screens to drill down and investigate any particular entity, which is the customer in our case. There is even a review section to mark if an entity’s risk scores have been reviewed making this useful for compliance checks for review.

This part of KYC is set up and ready to go thanks to this app.

Machine Learning

Rupert and Josh’s app also has screens for using machine learning (e.g., probability density function) to find outliers within all entities, without having to learn in depth data science. The question may be asked, why not use machine learning to find out anomalies within the set of transactions for each customer? This is a matter of practicality because the way machine learning typically works is that it builds a model for a dataset to apply for future data. Building a million models for a million customers is probably an overkill. A more maintainable approach would be to cluster each customer by a segment such as transaction amounts. Some customers will be clustered as average amounts around 50. Others may be clustered with 500. Some may even be clustered with 500,000 as their typical amounts. Now, one can build a model per cluster and find outliers per cluster rather than for individual customers. This makes it scale better and an order of magnitude more manageable.

Conclusion

The KYC use case is an important banking regulation and continuous monitoring is the most vital part of it. What was discussed was an easier approach to operationalize monitoring each customer’s transactions, and hence their behavior for outliers. The Splunk App for Behavior Profiling can be used for a variety of FSI use cases where one is looking for anomalies within any set of entities or for each entity against themselves as well.

Related Articles

How Splunk is Helping Shape the Future of Higher Education IT by Tackling EDUCAUSE 2026 Top Issues
Industries
3 Minute Read

How Splunk is Helping Shape the Future of Higher Education IT by Tackling EDUCAUSE 2026 Top Issues

Dive into how Splunk aligns with key priorities highlighted at EDUCAUSE 2025.
Enhancing Government Resilience: How AI and Automation Empower Public Sector Missions
Industries
3 Minute Read

Enhancing Government Resilience: How AI and Automation Empower Public Sector Missions

Splunk helps government agencies boost security and efficiency with powerful, mission-ready AI and automation.
Solving Manual Mayhem in Telecom with Agentic AI
Industries
3 Minute Read

Solving Manual Mayhem in Telecom with Agentic AI

Agentic AI cuts downtime, improves security, and boosts customer experience, and with unified data from Splunk and Cisco, teams can build more resilient operations.
Upgrading to Splunk Enterprise 10.0 and Splunk Cloud Platform 10.0: Key Resources for Public Sector Customers
Industries
2 Minute Read

Upgrading to Splunk Enterprise 10.0 and Splunk Cloud Platform 10.0: Key Resources for Public Sector Customers

Splunk Enterprise 10.0 and Splunk Cloud Platform 10.0 deliver the most secure, stable, and modernized platform for a digitally resilient and compliance-ready future.
Building the Next Generation of Defenders: From the Classroom to the SOC of the Future
Industries
3 Minute Read

Building the Next Generation of Defenders: From the Classroom to the SOC of the Future

Resilience in the AI era doesn’t just happen – it's built one student, one SOC, and one organisation at a time.
Analytics That Work: 3 Approaches for the Future of Contact Centers
Industries
3 Minute Read

Analytics That Work: 3 Approaches for the Future of Contact Centers

Splunker Khalid Ali explains how unified, real-time intelligence connects data, empowers agents, and builds lasting customer loyalty.
Observability + Security: Real-Time Digital Resilience for SLED
Industries
1 Minute Read

Observability + Security: Real-Time Digital Resilience for SLED

Cisco and Splunk are helping public sector organizations build digital resilience.
Digital Resilience for State and Local Governments (Part Two)
Industries
3 Minute Read

Digital Resilience for State and Local Governments (Part Two)

Discover how collaboration—powered by shared data platforms like Splunk—can enhance incident response and overall digital resilience.
Reflections from SIBOS 2025: How will advances in technology (and especially AI) change the financial services industry over the next 5 years?
Industries
2 Minute Read

Reflections from SIBOS 2025: How will advances in technology (and especially AI) change the financial services industry over the next 5 years?

Discover key insights from SIBOS 2025 on how AI, collaboration, and data will reshape financial services over the next 5 years—prepare for rapid change and exciting opportunities ahead.