Skip to main content

Simulation is our telescope to unveiling the fraud in the financial universe

Two years ago, I once said: “Simulation is my telescope to explore the universe of financial fraud” just a few days ago, this comparison came back to my mind when the first ever picture of a black hole was achieved through a coordinated effort of more than 200 scientists and observatories all around the globe.



“I can think of myself as an astronomer, eager to learn more about the universe, but without a proper tool or telescope, it would be hard to learn more than I can see with my eyes. Simulation is my telescope to explore the universe of financial fraud.” Dr Edgar Lopez-Rojas (interview at Kaggle 2017).

Some years ago, I published a synthetic dataset in Kaggle that contained a generated scenario from a real dataset sample for a Mobile Money Payments system in an African country. The PaySim dataset was the first synthetic transactional dataset shared openly and released in a known data scientist community platform. This fact led Kaggle to name the PaySim dataset as the “dataset of the week”, and during a short interview, I compared the use of simulation with the development of a telescope. Today, two years after, the PaySim dataset has acquired great popularity, with more than 120k views so far, and has been widely used by several students for their thesis, practitioners, enthusiasts and recognised institutions such as Databricks and IBM-MIT Watson lab.

The war against crime requires similar effort and coordinated actions from all parties to significantly improve and move the odds to win on our side. I believe Simulation is the key to unveiling a new universe of hidden fraud in our financial services.

The modern world and new technologies possess challenges that limit our efficiency and efficacy against criminal activities. But at the same time bring new opportunities for the soldiers against fraud.

Using simulation and generated synthetic datasets, researchers and financial services can communicate results under the strict confidentiality agreements that represent the handling of personal data and the new GDPR regulations. Regulators have the task of funding and incentivising the efforts of academia and the financial sector. At the same time, use simulation models to create better policies that are up to the task of efficiently reducing crime profit.

Galileo didn’t build the Everest Horizon Virtual Telescope on his first attempt, but he laid the foundation for others to incrementally build more and better telescopes every day. Simulation has been among us for several years. Today, we have the technology, the computational power and the knowledge to use it as a powerful weapon in the fight against financial fraud. 

Comments

Popular posts from this blog

Synthetic Data: The Future of Data Privacy and Security

 In today's data-driven world, companies and organisations across industries constantly collect, analyse, and utilise data to inform their decisions and strategies. However, using real-world data can present various challenges, particularly in data privacy and security. This is where synthetic data comes in as an innovative solution that balances data privacy and data utility. In this blog post, we'll explore the importance of synthetic data and why it should be considered a best practice in various industries, focusing on personal identifiable data and GDPR. What is Synthetic Data? Synthetic data refers to artificially generated data that mimics the statistical properties of real-world data. It can be created using various techniques such as generative adversarial networks (GANs), agent-based simulations (ABM), variational autoencoders (VAEs), and other machine learning algorithms. The resulting synthetic data can be used in place of real-world data in various applications, su

Financial Crime Vaccines: A Consortium Approach to Tackle Financial Crime Using Trustworthy AI and Synthetic Data

 Financial crime continues to pose a significant threat to the global financial system, and traditional methods of detecting and preventing these crimes are often inadequate. In recent years, the use of artificial intelligence (AI) has emerged as a promising solution to this problem, but it requires high-quality data to be effective. This is where financial crime vaccines come into play. Financial crime vaccines use synthetic data to train AI systems, creating a more robust and effective defence against financial crime. Using synthetic data, financial institutions can train AI models to identify and prevent financial crime without exposing sensitive customer information. This makes financial crime vaccines a safe and secure way to fight financial crime. But the creation of a financial crime vaccine is a complex task requiring multiple financial institutions' collaboration. This is why a consortium is necessary to develop and deploy financial crime vaccines successfully. Financial c

Financial Synthetic Data is the New Oil for FinCrime Analytics

Financial Data is significantly constrained by customer privacy regulations such as GDPR, which hampers the possibility of collaboration between different stakeholders in financial problems such as optimising Anti-Money Laundering (AML) tools and reducing financial crime. Solutions based on Machine Learning (ML) are on the way, but unfortunately, the quality data required to train the models is unavailable. The three most significant drawbacks of using ML for AML are lack of 'labelled data', the 'imbalance class' of misbehaviour financial activity and finally, the evolving threat of finCrime that makes 'training datasets obsolete'. These drawbacks are derived from the unknown ‘hidden crime’ problem. We address this problem by creating digital synthetic twins of financial data significantly enriched for advanced solutions based on machine learning. We add to our models the known crime dynamics specially tailored to match undesirable scenarios in our financial ins