Ensemble of ML-KNN for classification algorithm recommendation

Knowledge-Based Systems - Tập 221 - Trang 106933 - 2021
Xiaoyan Zhu1, Chenzhen Ying1, Jiayin Wang1, Jiaxuan Li1, Xin Lai1, Guangtao Wang2
1School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
2JD AI Research, Mountain View, CA, United States of America

Tài liệu tham khảo

Quinlan, 2014 Breiman, 1984 Moore, 2005, Internet traffic classification using bayesian analysis techniques, 33, 50 Webb, 2005, Not so naive bayes: aggregating one-dependence estimators, Mach. Learn., 58, 5, 10.1007/s10994-005-4258-6 Holte, 1993, Very simple classification rules perform well on most commonly used datasets, Mach. Learn., 11, 63, 10.1023/A:1022631118932 W.W. Cohen, Fast effective rule induction, in: Proceedings of the twelfth international conference on machine learning, 1995, pp. 115–123. Huang, 2020, Relation classification via knowledge graph enhanced transformer encoder, Knowl.-Based Syst., 206, 1 D.H. Wolpert, The supervised learning no-free-lunch theorems, in: World Conference on Soft Computing, 2002, pp. 25–42. Ali, 2006, On learning algorithm selection for classification, Appl. Soft Comput., 6, 119, 10.1016/j.asoc.2004.12.002 P.B. Brazdil, C. Soares, A comparison of ranking methods for classification algorithm selection, in: European Conference on Machine Learning, 2000, pp. 63–74. Song, 2012, Automatic recommendation of classification algorithms based on data set characteristics, Pattern Recognit., 45, 2672, 10.1016/j.patcog.2011.12.025 L. Chekina, L. Rokach, B. Shapira, Meta-learning for selecting a multi-label classification algorithm, in: IEEE International Conference on Data Mining Workshops, 2012, pp. 220–227. Wang, 2015, An improved data characterization method and its application in classification algorithm recommendation, Appl. Intell., 43, 892, 10.1007/s10489-015-0689-3 Gore, 2013, Dynamic algorithm selection for data mining classification, Int. J. Sci. Eng. Res., 4, 2029 Ali, 2017, Accurate multi-criteria decision making methodology for recommending machine learning algorithm, Expert Syst. Appl., 71, 257, 10.1016/j.eswa.2016.11.034 Brazdil, 2003, Ranking learning algorithms: Using ibl and meta-learning on accuracy and time results, Mach. Learn., 50, 251, 10.1023/A:1021713901879 Lee, 2013, Automatic selection of classification learning algorithms for data mining practitioners, Intell. Data Anal., 17, 665, 10.3233/IDA-130599 Bensusan, 2001, Estimating the predictive accuracy of a classifier, 25 Reif, 2014, Automatic classifier selection for non-experts, Pattern Anal. Appl., 17, 83, 10.1007/s10044-012-0280-z Balte, 2015, Algorithm selection based on landmarking meta-feature, Commun. Appl. Electron., 2, 23, 10.5120/cae2015651784 Wang, 2014, A generic multilabel learning-based classification algorithm recommendation method, ACM Trans. Knowl. Discov. Data (TKDD), 9, 7 Zhang, 2007, Ml-knn: A lazy learning approach to multi-label learning, Pattern Recognit., 40, 2038, 10.1016/j.patcog.2006.12.019 Zhu, 2018, A new classification algorithm recommendation method based on link prediction, Knowl. Based Syst., 159, 171, 10.1016/j.knosys.2018.07.015 C. Yang, Y. Akimoto, D.W. Kim, M. Udell, Oboe: Collaborative filtering for automl model selection, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019, pp. 1173–1183. Breiman, 1996, Stacked regressions, Mach. Learn., 24, 49, 10.1007/BF00117832 C.E. Brodley, Addressing the selective superiority problem: Automatic algorithm/model class selection, in: Proceedings of the tenth international conference on machine learning, 1993, pp. 17–24. Balte, 2014, Meta-learning with landmarking: A survey, Int. J. Comput. Appl., 105, 47 Ali, 2018, A case-based meta-learning and reasoning framework for classifiers selection, 31:1 Bensusan, 1998, God doesn’t always shave with occam’s razor—learning when and how to prune, 119 Peng, 2002, Improved dataset characterisation for meta-learning, 141 Ho, 2000, Complexity of classification problems and comparative advantages of combined classifiers, 97 Ho, 2002, Complexity measures of supervised classification problems, IEEE Trans. Pattern Anal. Mach. Intell., 24, 289, 10.1109/34.990132 N. Pise, P. Kulkarni, Algorithm selection for classification problems, in: Sai Computing Conference, 2016, pp. 203–211. Yang, 2017, Choosing classification algorithms and its optimum parameters based on data set characteristics, J. Comput., 28, 26 Khan, 2020, A literature survey and empirical study of meta-learning for classifier selection, IEEE Access, 8, 10262, 10.1109/ACCESS.2020.2964726 Dietterich, 1997, Machine-learning research, AI Mag., 18 Dietterich, 2002, The handbook of brain theory and neural networks, Ensemble Learn, 2, 110 Hansen, 1990, Neural network ensembles, IEEE Trans. Pattern Anal. Mach. Intell., 12, 993, 10.1109/34.58871 Toothaker, 1991 Demšar, 2006, Statistical comparisons of classifiers over multiple data sets, J. Mach. Learn. Res., 7, 1 Freund, 1997, A decision-theoretic generalization of on-line learning and an application to boosting, J. Comput. Syst. Sci., 55, 119, 10.1006/jcss.1997.1504 M. Hall, Correlation-Based Feature Selection for Machine Learning (Ph.D. Thesis), Waikato Univer-sity 19. Korf, 1993, Linear-space best-first search, Artificial Intelligence, 62, 41, 10.1016/0004-3702(93)90045-D Casalicchio, 2017, Openml: An r package to connect to the machine learning platform openml, Comput. Statist., 32, 1 Alcalá-Fdez, 2010, Keel data-mining software tool: data set repository, integration of algorithms and experimental analysis framework, J. Mult.-Valued Logic Soft Comput., 17, 255 Liu, 2010, Link prediction based on local random walk, Europhys. Lett., 89, 58007, 10.1209/0295-5075/89/58007 Jain, 1999, Data clustering: a review, ACM Comput. Surv. (CSUR), 31, 264, 10.1145/331499.331504 Xu, 2020, Deep clustering by maximizing mutual information in variational auto-encoder, Knowl.-Based Syst., 205, 10.1016/j.knosys.2020.106260 Chandrashekar, 2014, A survey on feature selection methods, Comput. Electr. Eng., 40, 16, 10.1016/j.compeleceng.2013.11.024 Gao, 2020, Relevance assignation feature selection method based on mutual information for machine learning, Knowl.-Based Syst., 209, 10.1016/j.knosys.2020.106439 Pan, 2009, A survey on transfer learning, IEEE Trans. Knowl. Data Eng., 22, 1345, 10.1109/TKDE.2009.191 M. Long, J. Wang, G. Ding, J. Sun, P.S. Yu, Transfer feature learning with joint distribution adaptation, in: Proceedings of the IEEE international conference on computer vision, 2013, pp. 2200–2207. Lu, 2015, Transfer learning using computational intelligence: A survey, Knowl.-Based Syst., 80, 14, 10.1016/j.knosys.2015.01.010 L. Zhong, Z. Fang, F. Liu, J. Lu, B. Yuan, G. Zhang, How does the combined risk affect the performance of unsupervised domain adaptation approaches? arXiv preprint arXiv:2101.01104.