Title Identifying irregular financial operations using accountant comments and natural language processing techniques /
Authors Rudžionis, Vytautas ; Lopata, Audrius ; Gudas, Saulius ; Butleris, Rimantas ; Veitaitė, Ilona ; Dilijonas, Darius ; Grišius, Evaldas ; Zwitserloot, Maarten ; Rudzioniene, Kristina
DOI 10.3390/app12178558
Full Text Download
Is Part of Applied sciences.. Basel : MDPI. 2022, vol. 12, iss. 17, art. no. 8558, p. 1-15.. ISSN 2076-3417
Keywords [eng] natural language processing ; semantic similarity ; cosine similarity ; parsing ; outliers detection
Abstract [eng] Featured Application The paper presents application of natural language processing techniques on accountant left comments to identify potentially irregular financial operations. Finding not typical financial operations is a complicated task. The difficulties arise not only due to the sophisticated actions of fraudsters but also because of the large number of financial operations performed by business companies. This is especially true for large companies. It is highly desirable to have a tool to reduce the number of potentially irregular operations significantly. This paper presents an implementation of NLP-based algorithms to identify irregular financial operations using comments left by accountants. The comments are freely written and usually very short remarks used by accountants for personal information. Implementation of content analysis using cosine similarity showed that identification of the type of operation using the comments of accountants is very likely. Further comment content analysis and financial data analysis showed that it could be expected to reduce the number of potentially suspicious operations significantly: analysis of more than half a million financial records of Dutch companies enabled the identification of 0.3% operations that may be potentially suspicious. This could make human financial auditing easier and more robust task.
Published Basel : MDPI
Type Journal article
Language English
Publication date 2022
CC license CC license description