Entity resolution: the secret agent of Continuous Controls Monitoring
We often talk about what Continuous Controls Monitoring is and why organisations need to get visibility into their security posture. But we rarely talk about how it actually works. There is an open data-science secret to automating security measurement: entity resolution.Much like MI6 have James Bond, CCM’s secret agent is entity resolution. Clunky metaphor aside, let’s get into entity resolution and its application in CCM.
What is entity resolution?
Entity resolution is the data science process that CCM relies on to be able to take data from all kinds of sources and aggregate it into useful security metrics. At a high level, it works out what information is about the same thing and how many distinct things are represented in all that data.Here’s a quick rundown of what CCM does: CCM ingests data from various disparate sources in your organisation and uses entity resolution to create a unified, context-rich view. The process cleans, normalises, aggregates, de-duplicates, correlates and unifies data from all these sources. But what value does this bring? Entity resolution provides links and enrichment across the ‘entities’ in an organisation – that is to say: people, endpoints, servers, accounts, databases, applications, and more. It is the ‘secret agent’ that combines all the fragments of data from all your siloed data sources. It gives the ability to understand every asset, the status of all controls relating to that asset, and how those assets relate to each other and the business. This view is fundamental for any analysis about your assets. Without it, it would be challenging to even determine with confidence how many endpoints you have. This allows automated, real-time security metrics that provide a view of the overall security posture of the organisation. Entity resolution is not something unique to Panaseer or Continuous Controls Monitoring, though. It is a process that has applications across marketing, finance, investigation and more. However, it is relatively new to apply entity resolution to understanding cybersecurity posture. In the context of CCM, it takes data from multiple sources across the environment to create the richest possible view of each entity.An example of entity resolution
To explain the concept to a layperson, we like to use what we call the ‘James Bond’ example (hence the clunky metaphor). Say we have a number of data sources providing different pieces of information about people. We want to get all the information that we can about one of the people in those records – James Bond. All of the data sources provide records about him, but they refer to him in different ways. These data sources might include:- The MI6 HR database
- The registry of international spies
- Goldfinger’s database of enemies
- James Bond
- Bond, James
- 007
- james.bond@mi6.com (his work email)
- +447007007 (his work phone)