
Money laundering is a profound global problem. Nonetheless, there is little scientific literature on statistical and machine learning methods for anti-money laundering. In this paper, we focus on anti-money laundering in banks and provide an introduction and review of the literature. We propose a unifying terminology with two central elements: (i) client risk profiling and (ii) suspicious behavior flagging. We find that client risk profiling is characterized by diagnostics, i.e., efforts to find and explain risk factors. On the other hand, suspicious behavior flagging is characterized by non-disclosed features and hand-crafted risk indices. Finally, we discuss directions for future research. One major challenge is the need for more public data sets. This may potentially be addressed by synthetic data generation. Other possible research directions include semi-supervised and deep learning, interpretability, and fairness of the results.
Accepted for publication in IEEE Access, vol. 11, pp. 8889-8903, doi:10.1109/ACCESS.2023.3239549
FOS: Computer and information sciences, Anti-money laundering, Computer Science - Machine Learning, machine learning, literature review, Statistics - Machine Learning, know-your-client, Machine Learning (stat.ML), Electrical engineering. Electronics. Nuclear engineering, TK1-9971, Machine Learning (cs.LG)
FOS: Computer and information sciences, Anti-money laundering, Computer Science - Machine Learning, machine learning, literature review, Statistics - Machine Learning, know-your-client, Machine Learning (stat.ML), Electrical engineering. Electronics. Nuclear engineering, TK1-9971, Machine Learning (cs.LG)
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 7 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
