Mining corporate annual reports for intelligent detection of financial statement fraud - A comparative study of machine learning methods
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216275%3A25410%2F17%3A39910599" target="_blank" >RIV/00216275:25410/17:39910599 - isvavai.cz</a>
Result on the web
<a href="http://www.sciencedirect.com/science/article/pii/S0950705117302022" target="_blank" >http://www.sciencedirect.com/science/article/pii/S0950705117302022</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1016/j.knosys.2017.05.001" target="_blank" >10.1016/j.knosys.2017.05.001</a>
Alternative languages
Result language
angličtina
Original language name
Mining corporate annual reports for intelligent detection of financial statement fraud - A comparative study of machine learning methods
Original language description
Financial statement fraud has been serious concern for investors, audit firms, government regulators, and other capital market stakeholders. Intelligent financial statement fraud detection systems have therefore been developed to support decision-making of the stakeholders. Fraudulent misrepresentation of financial statements in managerial comments has been noticed in recent studies. As such, the purpose of this study was to examine whether an improved financial fraud detection system could be developed by combining specific features derived from financial information and managerial comments in corporate annual reports. To develop this system, we employed both intelligent feature selection and classification using a wide range of machine learning methods. We found that ensemble methods outperformed the remaining methods in terms of true positive rate (fraudulent firms correctly classified as fraudulent). In contrast, Bayesian belief networks (BBN) performed best on non-fraudulent firms (true negative rate). This finding is important because interpretable "green flag" values (for which fraud is likely absent) could be derived, providing potential decision support to auditors during client selection or audit planning. We also observe that both financial statements and text in annual reports can be utilised to detect non-fraudulent firms. However, non-annual report data (analysts' forecasts of revenues and earnings) are necessary to detect fraudulent firms. This finding has important implications for selecting variables when developing early warning systems of financial statement fraud.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GA16-19590S" target="_blank" >GA16-19590S: Topic and sentiment analysis of multiple textual sources for corporate financial decision-making</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2017
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Knowledge-Based Systems
ISSN
0950-7051
e-ISSN
—
Volume of the periodical
128
Issue of the periodical within the volume
July
Country of publishing house
NL - THE KINGDOM OF THE NETHERLANDS
Number of pages
14
Pages from-to
139-152
UT code for WoS article
000403633500012
EID of the result in the Scopus database
—