Using machine learning to detect misstatements

Springer Science and Business Media LLC - Tập 26 - Trang 468-519 - 2020
Jeremy Bertomeu1, Edwige Cheynel1, Eric Floyd2, Wenqiang Pan3
1Olin Business School, Washington University, Saint Louis, USA
2Rady School of Management, University of California, San Diego, La Jolla, USA
3Columbia Business School, Columbia University, New York City, USA

Tóm tắt

Machine learning offers empirical methods to sift through accounting datasets with a large number of variables and limited a priori knowledge about functional forms. In this study, we show that these methods help detect and interpret patterns present in ongoing accounting misstatements. We use a wide set of variables from accounting, capital markets, governance, and auditing datasets to detect material misstatements. A primary insight of our analysis is that accounting variables, while they do not detect misstatements well on their own, become important with suitable interactions with audit and market variables. We also analyze differences between misstatements and irregularities, compare algorithms, examine one-year- and two-year-ahead predictions and interpret groups at greater risk of misstatements.

Tài liệu tham khảo

Abbasi, A., Albrecht, C., Vance, A., & Hansen, J. (2012). Metafraud: a meta-learning framework for detecting financial fraud. Mis Quarterly, 36(4), 1293–1327. Avramov, D., Chordia, T., Jostova, G., & Philipov, A. (2009). Credit ratings and the cross-section of stock returns. Journal of Financial Markets, 12 (3), 469–499. Bao, Y., Ke, B., Li, B., Julia Yu, Y., & Zhang, J. (2020). Detecting accounting fraud in publicly traded us firms using a machine learning approach. Journal of Accounting Research, 58(1), 199–235. Barton, J., & Simko, P.J. (2002). The balance sheet as an earnings management constraint. The Accounting Review, 77(s-1), 1–27. Beneish, M.D. (1999). The detection of earnings manipulation. Financial Analysts Journal, 55(5), 24–36. Bertomeu, J., & Marinovic, I. (2015). A Theory of hard and soft information. The Accounting Review, 91(1), 1–20. Blackburne, T., Kepler, J., Quinn, P., & Taylor, D. (2020). Undisclosed sec investigations. Forthcoming Management Science. Cheffers, M., Whalen, D., & Usvyatsky, O. (2010). 2009 financial restatements: A nine year comparison. Audit Analytics Sales (February). Cheynel, E., & Levine, C. (2020). Public disclosures and information asymmetry: A theory of the mosaic. The Accounting Review, 95(1), 79–99. Dechow, P.M., & Dichev, I.D. (2002). The quality of accruals and earnings: The role of accrual estimation errors. The Accounting Review, 77(s-1), 35–59. Dechow, P.M., Ge, W., Larson, C.R., & Sloan, R.G. (2011). Predicting material accounting misstatements. Contemporary Accounting Research, 28(1), 17–82. DeFond, M.L., Raghunandan, K., & Subramanyam, K.R. (2002). Do non–audit service fees impair auditor independence? evidence from going concern audit opinions. Journal of Accounting Research, 40(4), 1247–1274. Deng, H. (2018). Interpreting tree ensembles with inttrees. International Journal of Data Science and Analytics, pp 1–11. Ding, K., Lev, B., Peng, X., Sun, T., & Vasarhelyi, M.A. (2020). Machine learning improves accounting estimates. Review of Accounting Studies, pp 1–37. Dutta, I., Dutta, S., & Raahemi, B. (2017). Detecting financial restatements using data mining techniques. Expert Systems with Applications, 90, 374–393. Ettredge, M.L., Sun, L., Lee, P., & Anandarajan, A.A. (2008). Is earnings fraud associated with high deferred tax and/or book minus tax levels?. Auditing: A Journal of Practice & Theory, 27(1), 1–33. Fanning, K.M., & Cogger, K.O. (1998). Neural network detection of management fraud using published financial data. Intelligent Systems in Accounting, Finance & Management, 7(1), 21–41. Fawcett, T. (2006). An introduction to roc analysis. Pattern Recognition Letters, 27(8), 861– 874. Frankel, R.M., Johnson, M.F., & Nelson, K.K. (2002). The relation between auditors’ fees for nonaudit services and earnings management. The Accounting Review, 77(s-1), 71–105. Friedman, J.H. (2001). Greedy function approximation: a gradient boosting machine. Annals of Statistics, pp 1189–1232. Friedman, J., Hastie, T., & Tibshirani, R. (2001). The Elements of Statistical Learning Vol. 1. New York: Springer series in statistics. Garfinkel, J.A. (2009). Measuring investors’ opinion divergence. Journal of Accounting Research, 47(5), 1317–1348. Glosten, L.R., & Milgrom, P.R. (1985). Bid, ask and transaction prices in a specialist market with heterogeneously informed traders. Journal of Financial Economics, 14(1), 71–100. Green, B.P., & Choi, J.H. (1997). Assessing the risk of management fraud through neural network technology. Auditing, A Journal of Practice and Theory, 16, 14–28. Guelman, L. (2012). Gradient boosting trees for auto insurance loss cost modeling and prediction. Expert Systems with Applications, 39(3), 3659–3667. Gupta, R., & Gill, N.S. (2012). A solution for preventing fraudulent financial reporting using descriptive data mining techniques. International Journal of Computer Applications. Hribar, P., Kravet, T., & Wilson, R. (2014). A New measure of accounting quality. Review of Accounting Studies, 19(1), 506–538. Johnson, V.E., Khurana, I.K., & Kenneth Reynolds, J. (2002). Audit-firm tenure and the quality of financial reports. Contemporary Accounting Research, 19(4), 637–660. Kasznik, R. (1999). On the association between voluntary disclosure and earnings management. Journal of Accounting Research, 37(1), 57–81. Kim, Y.J., Baik, B., & Cho, S. (2016). Detecting financial misstatements with fraud intention using multi-class cost-sensitive learning. Expert Systems with Applications, 62, 32–43. Kleinberg, J., Lakkaraju, H., Leskovec, J., Ludwig, J., & Mullainathan, S. (2017). Human decisions and machine predictions. The Quarterly Journal of Economics, 133(1), 237–293. Kornish, L.J., & Levine, C.B. (2004). Discipline with common agency: The case of audit and nonaudit services. The Accounting Review, 79(1), 173–200. Larcker, D.F., Richardson, S.A., & Tuna, Irem. (2007). Corporate governance, accounting outcomes, and organizational performance. The Accounting Review, 82(4), 963–1008. Laux, V., & Newman, P.D. (2010). Auditor liability and client acceptance decisions. The Accounting Review, 85(1), 261–285. Lin, J.W., Hwang, M.I., & Becker, J.D. (2003). A Fuzzy neural network for assessing the risk of fraudulent financial reporting. Managerial Auditing Journal, 18(8), 657–665. Lobo, G.J., & Zhao, Y. (2013). Relation between audit effort and financial report misstatements: Evidence from quarterly and annual restatements. The Accounting Review, 88(4), 1385–1412. Perols, J. (2011). Financial statement fraud detection: An analysis of statistical and machine learning algorithms. Auditing: A Journal of Practice & Theory, 30(2), 19–50. Perols, J.L., Bowen, R.M., Zimmermann, C., & Samba, B. (2016). Finding needles in a haystack: Using data analytics to improve fraud prediction. The Accounting Review, 92(2), 221–245. Ragothaman, S., & Lavin, A. (2008). Restatements due to improper revenue recognition: a neural networks perspective. Journal of Emerging Technologies in Accounting, 5(1), 129–142. Romanus, R.N., Maher, J.J., & Fleming, D.M. (2008). Auditor industry specialization, auditor changes, and accounting restatements. Accounting Horizons, 22(4), 389–413. Samuels, D., Taylor, D.J., & Verrecchia, R.E. (2018). Financial misreporting: Hiding in the shadows or in plain sight?. Rijsbergen, V., & Joost, C. (2004). The geometry of information retrieval. Cambridge University Press. Whiting, D.G., Hansen, J.V., McDonald, J.B., Albrecht, C., & Steve Albrecht, W. (2012). Machine learning methods for detecting patterns of management fraud. Computational Intelligence, 28(4), 505–527. Zhang, Y., & Haghani, A. (2015). A gradient boosting method to improve travel time prediction. Transportation Research Part C: Emerging Technologies, 58, 308–324.