Data mining for credit card fraud: A comparative study

Decision Support Systems - Tập 50 Số 3 - Trang 602-613 - 2011
Siddhartha Bhattacharyya1, Sanjeev Jha2, Kurian Tharakunnel3, J. Christopher Westland4
1Department of Information and Decision Sciences (MC 294), College of Business Administration, University of Illinois, Chicago, 601 South Morgan Street, Chicago, Illinois 60607-7124, USA
2Department of Decision Sciences, Whittemore School of Business and Economics, University of New Hampshire, McConnell Hall, Durham, New Hampshire 03824-3593, USA.
3Tabor School of Business, Millikin University, 1184 West Main Street, Decatur, IL 62522, USA#TAB#
4Department of Information & Decision Sciences (MC 294), College of Business Administration, University of Illinois, Chicago, 601 S. Morgan Street, Chicago, IL 60607-7124, USA

Tóm tắt

Từ khóa


Tài liệu tham khảo

Aleskerov, 1998, CARDWATCH: a neural network based database mining system for credit card fraud detection, in computational intelligence for financial engineering, 220

Artis, 2002, Detection of automobile insurance fraud with discrete choice models and misclassified claims, The Journal of Risk and Insurance, 69, 325, 10.1111/1539-6975.00022

Bolton, 2001, Unsupervised profiling methods for fraud detection

Bolton, 2002, Statistical fraud detection: a review, Statistical Science, 17, 235, 10.1214/ss/1042727940

Brause, 1999, Neural data mining for credit card fraud detection, 103

Breiman, 2001, Random forest, Machine Learning, 45, 5, 10.1023/A:1010933404324

Bollinger, 1997, Modeling discrete choice with response error: food stamp participation, Journal of the American Statistical Association, 92, 827, 10.2307/2965547

CapitalOne Identity theft guide for victims, retrieved January 10, 2009, from http://www.capitalone.com/fraud/IDTheftPackageV012172004We.pdf?linkid=WWW_Z_Z_Z_FRD_D1_01_T_FIDTP.

Caruana, 2008

Caruana, 2006

Caudill, 2005, Fraud detection using a multinomial logit model with missing information, The Journal of Risk and Insurance, 72, 539, 10.1111/j.1539-6975.2005.00137.x

Chan, 1999, 67

Chawla, 2004, Editorial: special issue on learning from imbalanced data sets, ACM SIGKDD Explorations Newsletter, 6, 10.1145/1007730.1007733

Chen, 2006, A new binary support vector system for increasing detection rate of credit card fraud, International Journal of Pattern Recognition, 20, 227, 10.1142/S0218001406004624

Chen, 2004

Cristianini, 2000

CyberSource. Online fraud report: online payment, fraud trends, merchant practices, and bench marks, retrieved January 8, 2009, from http://www.cybersource.com.

Dietterich, 2002, Ensemble learning

Dorronsoro, 1997, Neural fraud detection in credit card operations, IEEE Transactions on Neural Networks, 8, 827, 10.1109/72.595879

Everett, 2009

FairIssac. Falcon Fraud Manager, retrieved January 8, 2009, http://www.fairisaac.com/ficx/Products/dmapps/Falcon-Fraud-Manager.html.

Ghosh, 1994, Credit card fraud detection with a neural-network

Hausman, 1998, Misclassification of a dependent variable in a discrete-response setting, Journal of Econometrics, 87, 239, 10.1016/S0304-4076(98)00015-3

Hosmer, 2000

Hulse, 2007

Jin, 2005, Binary choice models for rare events data: a crop insurance fraud application, Applied Economics, 37, 841, 10.1080/0003684042000337433

Khoshgoftaar, 2007, An empirical study of learning from imbalanced data using random forest, vol. 2

Kou, 2004, Survey of fraud detection techniques, in: IEEE International Conference on Networking, Sensing and Control, 749

Kuncheva, 2005

Larivière, 2005, Predicting customer retention and profitability by using random forests and regression forests techniques, Expert Systems with Applications, 29, 472, 10.1016/j.eswa.2005.04.043

Lee, 2005, An extensive comparison of recent classification tools applied to microarray data, Computational Statistics and Data Analysis, 48, 869, 10.1016/j.csda.2004.03.017

H. Leggatt, CyberSource: Online fraud to reach $4 billion. BizReport, December 16, 2008.

Ling, 2003, AUC: a statistically consistent and more discriminating measure than accuracy, 519

Meyer, 2003, The support vector machine under test, Neurocomputing, 55, 169, 10.1016/S0925-2312(03)00431-4

McAlearney, 2008

Magder, 1997, Logistic regression when the outcome is measured with uncertainty, American Journal of Epidemiology, 146, 195, 10.1093/oxfordjournals.aje.a009251

Paasch, 2007

Pal, 2005, Random forest classifier for remote sensing classification, International Journal of Remote Sensing, 26, 217, 10.1080/01431160412331269698

Platt, 1998, Fast training of support vector machines using sequential minimal optimization, 185

Polikar, 2006, Ensemble based systems in decision making, IEEE Circuits and Systems Magazine, 6, 21, 10.1109/MCAS.2006.1688199

Poterba, 1995, Unemployment benefits and labour market transitions: a multinomial logit model with errors in classification, The Review of Economics and Statistics, 77, 207, 10.2307/2109860

Provost, 2002, Comment on Bolton and Hand, Statistical Science, 17, 249

Provost, 1997, Analysis and visualization of classifier performance: comparison under imprecise class and cost distributions, 43

Sch¨olkopf, 2002

Srivastava, 2008, Credit card fraud detection using hidden Markov model, IEEE Transactions on Dependable and Secure Computing, 5, 37, 10.1109/TDSC.2007.70228

Statnikov, 2008, A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification, BMC Bioinformatics, 22, 9

Vapnik, 1998

Wheeler, 2000, Multiple algorithms for fraud detection, Knowledge-Based Systems, 99, 93, 10.1016/S0950-7051(00)00050-2

Whitrow, 2009, Transaction aggregation as a strategy for credit card fraud detection, Data Mining and Knowledge Discovery, 18, 30, 10.1007/s10618-008-0116-z

Williams, 2007

Xie, 2009, Customer churn prediction using improved balanced random forests, Expert Systems with Applications, 36, 5445, 10.1016/j.eswa.2008.06.121

Zhang, 2008, Random-forests-based network intrusion detection systems, IEEE Transactions on Systems, Man, and Cybernetics Part C: Applications and Reviews, 38, 649, 10.1109/TSMCC.2008.923876