Nghiên cứu về hành vi của một số phương pháp cân bằng dữ liệu huấn luyện máy học
Tóm tắt
Từ khóa
Tài liệu tham khảo
Batista G. E. A. P. A., 2003, WOB, 35
Blake C. and Merz C. UCI Repository of Machine Learning Databases 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html. Blake C. and Merz C. UCI Repository of Machine Learning Databases 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html.
Chawla N. V., 2003, Workshop on Learning from Imbalanced Data Sets II
Chawla N. V., 2002, SMOTE: Synthetic Minority Over-sampling Technique. JAIR, 16, 321
Ciaccia P., 1997, VLDB, 426
Drummond C., 2003, Workshop on Learning from Imbalanced Data Sets II
Ferri C., 2002, J. Learning Decision Trees Using the Area Under the ROC Curve. In ICML (, 139
Hand D. J., 1997, John Wiley and Sons
Japkowicz N., 2003, Workshop on Learning from Imbalanced Data Sets II
Japkowicz N., 2002, The Class Imbalance Problem: A Systematic Study. IDA Journal, 6, 5
Kubat M., 1997, Addressing the Course of Imbalanced Training Sets: One-sided Selection. In ICML, 179
Ling C. X., 1998, Data Mining for Direct Mining: Problems and Solutions. In KDD, 73
Mitchell T. M. Machine Learning. McGraw-Hill 1997. Mitchell T. M. Machine Learning. McGraw-Hill 1997.
Provost F. J., 1997, KDD, 43
Quinlan J. R. C4.5 Programs for Machine Learning. Morgan Kaufmann CA 1988. Quinlan J. R. C4.5 Programs for Machine Learning. Morgan Kaufmann CA 1988.
Tomek, 1976, Two Modifications of CNN. IEEE Transactions on Systems Man and Communications SMC-6 (, 769
Weiss G. M., 2003, The Effect of Class Distribution on Tree Induction. JAIR, 19, 315
Wilson D. L., 1972, Communications, 2, 3