
arXiv: 1908.00734
The detection of fraud in accounting data is a long-standing challenge in financial statement audits. Nowadays, the majority of applied techniques refer to handcrafted rules derived from known fraud scenarios. While fairly successful, these rules exhibit the drawback that they often fail to generalize beyond known fraud scenarios and fraudsters gradually find ways to circumvent them. In contrast, more advanced approaches inspired by the recent success of deep learning often lack seamless interpretability of the detected results. To overcome this challenge, we propose the application of adversarial autoencoder networks. We demonstrate that such artificial neural networks are capable of learning a semantic meaningful representation of real-world journal entries. The learned representation provides a holistic view on a given set of journal entries and significantly improves the interpretability of detected accounting anomalies. We show that such a representation combined with the networks reconstruction error can be utilized as an unsupervised and highly adaptive anomaly assessment. Experiments on two datasets and initial feedback received by forensic accountants underpinned the effectiveness of the approach.
11 pages, 9 figures, 2nd KDD Workshop on Anomaly Detection in Finance, August 05, 2019, Anchorage, Alaska
FOS: Computer and information sciences, FOS: Economics and business, Computer Science - Machine Learning, deep learning, accounting, artificial intelligence, machine learning, Statistical Finance (q-fin.ST), Statistics - Machine Learning, Quantitative Finance - Statistical Finance, computer science, Machine Learning (stat.ML), Machine Learning (cs.LG)
FOS: Computer and information sciences, FOS: Economics and business, Computer Science - Machine Learning, deep learning, accounting, artificial intelligence, machine learning, Statistical Finance (q-fin.ST), Statistics - Machine Learning, Quantitative Finance - Statistical Finance, computer science, Machine Learning (stat.ML), Machine Learning (cs.LG)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
