RUSBoost: A Hybrid Approach to Alleviating Class Imbalance
Tóm tắt
Từ khóa
Tài liệu tham khảo
landgrebe, 2006, precisionrecall operating characteristic (p-roc) curves in imprecise environments, Proc 18th Int Conf Pattern Recog, 4, 123
mitchell, 1997, Machine Learning
2004, SAS/STAT User's Guide
berenson, 1983, Intermediate Statistical Methods and Applications A Computer Package Approach
weiss, 2003, learning when training data are costly: the effect of class distribution on tree induction, J Artif Intell Res, 19, 315, 10.1613/jair.1199
barandela, 2004, the imbalanced training sample problem: under or over sampling?, Proc Joint IAPR Workshops SSPR, 3138, 806
han, 2005, borderline-smote: a new over-sampling method in imbalanced data sets learning, Proc ICIC, 3644, 878
seiffert, 2008, building useful models from imbalanced data with sampling and boosting, Proc 21st Int FLAIRS Conf, 306
fan, 1999, adacost: misclassification cost-sensitive boosting, Proc 16th Int Conf Mach Learn, 97
ting, 2000, a comparative study of cost-sensitive boosting algorithms, Proc 17th Int Conf Mach Learn, 983
aczel, 1975, On Measures of Information and their Characterizations
freund, 1996, experiments with a new boosting algorithm, Proc 13th Int Conf Mach Learn, 148
quinlan, 1993, C4 5 Programs for Machine Learning
drummond, 2003, c4.5, class imbalance, and cost sensitivity: why under-sampling beats over-sampling, Proc Int'l Conf Machine Learning Workshop Learning from Imbalanced Data Sets II, 1
chawla, 2003, smoteboost: improving prediction of the minority class in boosting, Principles and Practice of Knowledge Discovery in Databases, 107
freund, 1999, a short introduction to boosting, J Jpn Soc Artif Intell, 14, 771
chawla, 2002, smote: synthetic minority oversampling technique, J Artif Intell Res, 16, 321, 10.1613/jair.953
japkowicz, 2000, learning from imbalanced data sets: a comparison of various strategies, Proc AAAI Workshop Learning Imbalanced Data Sets, 10
mease, 2007, boosted classification trees and class probability/quantile estimation, J Mach Learn Res, 8, 409
weiss, 2007, cost-sensitive learning vs. sampling: which is best for handling unbalanced classes with unequal error costs?, Proc Int Conf Data Mining, 35
0, Metrics data program
elkan, 2001, the foundations of cost-sensitive learning, Proc 17th Int Conf Mach Learn, 239
witten, 2005, Data Mining Practical Machine Learning Tools and Techniques
blake, 1998, UCI repository of machine learning databases