Statistical approaches to combining binary classifiers for multi-class classification

Neurocomputing - Tập 74 - Trang 680-688 - 2011
Yuichi Shiraishi1, Kenji Fukumizu2
1Human Genome Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
2The Institute of Statistical Mathematics, 10-3 Midori-Cho, Tachikawa, Tokyo 190-8562, Japan

Tài liệu tham khảo

Allwein, 2000, Reducing multiclass to binary: a unifying approach for margin classifiers, Journal of Machine Learning Research, 1, 113 Bennett, 2005, The combination of text classifiers using reliability indicators, Information Retrieval, 8, 67, 10.1023/B:INRT.0000048491.59134.94 C.L. Blake, C.J. Merz, UCI repository of machine learning databases, Technical Report, Department of Information Computational Science, University of California, Irvine, 1998. Bradley, 1952, Rank analysis of incomplete block designs: I. The methods of paired comparisons, Biometrika, 39, 324, 10.2307/2334029 Bredensteiner, 1999, Multicategory classification by support vector machines, Computational Optimization and Applications, 12, 53, 10.1023/A:1008663629662 Breiman, 1996, Stacked regressions, Machine Learning, 24, 49, 10.1007/BF00117832 Burges, 1997, Improving the accuracy and speed of support vector machines, vol. 9, 375 Cortes, 1995, Multicategory classification by support vector machines, Machine Learning, 12, 273, 10.1023/A:1022627411411 Crammer, 2001, On the algorithmic implementation of multiclass kernel-based vector machines, Journal of Machine Learning Research, 2, 265 Crammer, 2002, On the learnability and design of output codes for multiclass problems, Machine Learning, 47, 201, 10.1023/A:1013637720281 David, 1988 Dekel, 2002, Multiclass learning by probabilistic embeddings, 945 Demšar, 2006, Statistical comparisons of classifiers over multiple data sets, Journal of Machine Learning Research, 7, 1 Dietterich, 1995, Solving multiclass learning problems via error-correcting output codes, Journal of Artificial Intelligence Research, 2, 263, 10.1613/jair.105 K. Duan, S.S. Keerthi, Which is the best multiclass SVM method? An empirical study, in: Multiple Classifier Systems, 2005, pp. 278–285. Džeroski, 2004, Is combining classifiers with stacking better than selecting the best one?, Machine Learning, 54, 255, 10.1023/B:MACH.0000015881.36452.6e J. Friedman, Another approach to polychotomous classification, Technical Report, Department of Statistics, Stanford University, 1996. Gama, 2000, Cascade generalization, Machine Learning, 41, 315, 10.1023/A:1007652114878 Hastie, 1998, Classification by pairwise coupling, Annals of Statistics, 26, 451, 10.1214/aos/1028144844 Hsu, 2002, A comparison of methods for multiclass support vector machines, IEEE Transactions on Neural Networks, 13, 415, 10.1109/72.991427 Huang, 2006, Generalized Bradley–Terry models and multi-class probability estimates, Journal of Machine Learning Research, 7, 85 Keerthi, 2006, Building support vector machines with reduced classifier complexity, Journal of Machine Learning Research, 7, 1493 Kittler, 1998, On combining classifiers, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 226, 10.1109/34.667881 LeBlanc, 1996, Combining estimates in regression and classification, Journal of the American Statistical Association, 91, 1641, 10.2307/2291591 Lee, 2004, Multicategory support vector machines: theory and application to the classification of microarray data and satellite radiance data, Journal of the American Statistical Association, 99, 67, 10.1198/016214504000000098 Meier, 2008, The group lasso for logistic regression, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70, 53, 10.1111/j.1467-9868.2007.00627.x Melvin, 2007, Multi-class protein classification using adaptive codes, Journal of Machine Learning Research, 8, 1557 Platt, 1999, Fast training of support vector machines using sequential minimal optimization, 185 Platt, 1999, Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods, 61 Platt, 2000, Large margin dags for multiclass classification, 547 Rätsch, 2003, Adapting codes and embeddings for polychotomies, 513 Rifkin, 2004, In defense of one-vs-all classification, Journal of Machine Learning Research, 5, 101 T. Takenouchi, S. Ishii, Multiclass classification as a decoding problem, in: Proceedings of the 9th Workshop on Information-Based Induction Sciences, 2006, pp. 244–249 (in Japanese). Tewari, 2007, On the consistency of multiclass classification methods, Journal of Machine Learning Research, 8, 1007 Tibshirani, 1996, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society, Series B, 58, 267 Todorovski, 2003, Combining classifiers with meta decision trees, Machine Learning, 50, 223, 10.1023/A:1021709817809 J. Weston, C. Watkins, Multi-class support vector machines, Technical Report, Department of Computer Science, University of London, 1998. Wolpert, 1992, Stacked generalization, Neural Networks, 5, 241, 10.1016/S0893-6080(05)80023-1 D.H. Wolpert, W.G. Macready, Combining stacking with bagging to improve a learning algorithm, Technical Report, Santa Fe Institute, 1996. Wu, 2004, Probability estimates for multi-class classification by pairwise coupling, Journal of Machine Learning Research, 5, 975 Yuan, 2006, Model selection and estimation in regression with grouped variables, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 68, 49, 10.1111/j.1467-9868.2005.00532.x Zhang, 2004, Statistical analysis of some multi-category large margin classification methods, Journal of Machine Learning Research, 5, 1225 Zhu, 2005, Kernel logistic regression and import vector machine, Journal of Computational and Graphical Statistics, 14, 185, 10.1198/106186005X25619