RUSBoost: A Hybrid Approach to Alleviating Class Imbalance

Institute of Electrical and Electronics Engineers (IEEE) - Tập 40 Số 1 - Trang 185-197 - 2010
Chris Seiffert1, Taghi M. Khoshgoftaar1, Jason Van Hulse1, Amri Napolitano1
1[Department of Computer Science & Engineering, Florida Atlantic University, Boca Raton, FL, USA]

Tóm tắt

Từ khóa


Tài liệu tham khảo

landgrebe, 2006, precisionrecall operating characteristic (p-roc) curves in imprecise environments, Proc 18th Int Conf Pattern Recog, 4, 123

10.1023/A:1007601015854

mitchell, 1997, Machine Learning

10.1016/B978-1-55860-335-6.50017-9

2004, SAS/STAT User's Guide

berenson, 1983, Intermediate Statistical Methods and Applications A Computer Package Approach

10.1057/palgrave.jors.2601932

10.1145/1143844.1143874

weiss, 2003, learning when training data are costly: the effect of class distribution on tree induction, J Artif Intell Res, 19, 315, 10.1613/jair.1199

10.1109/ICMLA.2007.76

barandela, 2004, the imbalanced training sample problem: under or over sampling?, Proc Joint IAPR Workshops SSPR, 3138, 806

han, 2005, borderline-smote: a new over-sampling method in imbalanced data sets learning, Proc ICIC, 3644, 878

seiffert, 2008, building useful models from imbalanced data with sampling and boosting, Proc 21st Int FLAIRS Conf, 306

fan, 1999, adacost: misclassification cost-sensitive boosting, Proc 16th Int Conf Mach Learn, 97

ting, 2000, a comparative study of cost-sensitive boosting algorithms, Proc 17th Int Conf Mach Learn, 983

10.1016/j.patcog.2007.04.009

10.1109/ICDM.2001.989527

10.1145/1007730.1007736

aczel, 1975, On Measures of Information and their Characterizations

freund, 1996, experiments with a new boosting algorithm, Proc 13th Int Conf Mach Learn, 148

quinlan, 1993, C4 5 Programs for Machine Learning

drummond, 2003, c4.5, class imbalance, and cost sensitivity: why under-sampling beats over-sampling, Proc Int'l Conf Machine Learning Workshop Learning from Imbalanced Data Sets II, 1

chawla, 2003, smoteboost: improving prediction of the minority class in boosting, Principles and Practice of Knowledge Discovery in Databases, 107

10.1016/B978-1-55860-377-6.50023-2

freund, 1999, a short introduction to boosting, J Jpn Soc Artif Intell, 14, 771

10.1145/1273496.1273614

chawla, 2002, smote: synthetic minority oversampling technique, J Artif Intell Res, 16, 321, 10.1613/jair.953

10.1145/1007730.1007735

japkowicz, 2000, learning from imbalanced data sets: a comparison of various strategies, Proc AAAI Workshop Learning Imbalanced Data Sets, 10

10.1145/1007730.1007734

mease, 2007, boosted classification trees and class probability/quantile estimation, J Mach Learn Res, 8, 409

10.1007/s10618-008-0087-0

weiss, 2007, cost-sensitive learning vs. sampling: which is best for handling unbalanced classes with unequal error costs?, Proc Int Conf Data Mining, 35

0, Metrics data program

elkan, 2001, the foundations of cost-sensitive learning, Proc 17th Int Conf Mach Learn, 239

witten, 2005, Data Mining Practical Machine Learning Tools and Techniques

blake, 1998, UCI repository of machine learning databases