A survey of multiple classifier systems as hybrid systems

Information Fusion - Tập 16 - Trang 3-17 - 2014
Michał Woźniak1, Manuel Graña2, Emilio Corchado3
1Department of Systems and Computer Networks, Wroclaw University of Technology, Wroclaw, Poland
2Computational Intelligence Group, University of the Basque Country, San Sebastian, Spain
3Departamento de Informática y Automática, University of Salamanca, Salamanca, Spain

Tài liệu tham khảo

Neumann, 1958 Newell, 1983, Intellectual issues in the history of artificial intelligence, 187 D. Wolpert, The supervised learning no-free-lunch theorems, in: Proceedings of the 6th Online World Conference on Soft Computing in Industrial Applications, 2001, pp. 25–42. Chow, 1965, Statistical independence and threshold functions, IEEE Transactions on Electronic Computers, EC-14, 66, 10.1109/PGEC.1965.264059 Shapley, 1984, Optimizing group judgmental accuracy in the presence of interdependencies, Public Choice, 43, 329, 10.1007/BF00118940 Dasarathy, 1979, A composite classifier system design: concepts and methodology, Proceedings of the IEEE, 67, 708, 10.1109/PROC.1979.11321 L. Rastrigin, R.H. Erenstein, Method of Collective Recognition, Energoizdat, Moscow, 1981. Hansen, 1990, Neural network ensembles, IEEE Transactions on Pattern Analysis and Machine Intelligence, 12, 993, 10.1109/34.58871 Xu, 1992, Methods of combining multiple classifiers and their applications to handwriting recognition, IEEE Transactions on Systems, Man and Cybernetics, 22, 418, 10.1109/21.155943 Tumer, 1996, Analysis of decision boundaries in linearly combined neural classifiers, Pattern Recognition, 29, 341, 10.1016/0031-3203(95)00085-2 Ho, 1994, Decision combination in multiple classifier systems, IEEE Transactions on Pattern Analysis and Machine Intelligence, 16, 66, 10.1109/34.273716 Breiman, 1996, Bagging predictors, Machine Learning, 24, 123, 10.1007/BF00058655 Schapire, 1990, The strength of weak learnability, Machine Learning, 5, 197, 10.1007/BF00116037 Freund, 1995, Boosting a weak learning algorithm by majority, Information Computing, 121, 256, 10.1006/inco.1995.1136 Kearns, 1994 Angluin, 1988, Queries and concept learning, Machine Learning, 2, 319, 10.1007/BF00116828 Jain, 2000, Statistical pattern recognition: a review, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 4, 10.1109/34.824819 Oza, 2008, Classifier ensembles: select real-world applications, Information Fusion, 9, 4, 10.1016/j.inffus.2007.07.002 Polikar, 2006, Ensemble based systems in decision making, IEEE Circuits and Systems Magazine, 6, 21, 10.1109/MCAS.2006.1688199 Polikar, 2008, Ensemble learning, Scholarpedia, 3, 2776 Rokach, 2009, Taxonomy for characterizing ensemble methods in classification tasks: a review and annotated bibliography, Computational Statistics and Data Analysis, 53, 4046, 10.1016/j.csda.2009.07.017 Kuncheva, 2004 Rokach, 2010 Seni, 2010 Baruque, 2011 Duda, 2001 Alpaydin, 2010 Bishop, 2006 Dietterich, 2000, Ensemble methods in machine learning, vol. 1857, 1 G. Marcialis, F. Roli, Fusion of face recognition algorithms for video-based surveillance systems, in: G.L. Foresti, C. Regazzoni, P. Varshney (Eds.), 2003, pp. 235–250. Hashem, 1997, Optimal linear combinations of neural networks, Neural Networks, 10, 599, 10.1016/S0893-6080(96)00098-6 Clemen, 1989, Combining forecasts: a review and annotated bibliography, International Journal of Forecasting, 5, 559, 10.1016/0169-2070(89)90012-5 Quinlan, 1993 Wilk, 2011, Complexity and multithreaded implementation analysis of one class-classifiers fuzzy combiner, vol. 6679, 237 T. Kacprzak, K. Walkowiak, M. Wozniak, Optimization of overlay distributed computing systems for multiple classifier system – heuristic approach, Logic Journal of IGPL, doi:10.1093/jigpal/jzr020. Walkowiak, 2010, Anycasting in connection-oriented computer networks: models, algorithms and results, International Journal of Applied Mathematics and Computer Sciences, 20, 207 Agrawal, 2000, Privacy-preserving data mining, SIGMOD Records, 29, 439, 10.1145/335191.335438 G. Giacinto, F. Roli, G. Fumera, Design of effective multiple classifier systems by clustering of classifiers, in: Proceedings of the 15th International Conference on Pattern Recognition, 2000, vol. 2, 2000, pp. 160–163. Ho, 2000, Complexity of classification problems and comparative advantages of combined classifiers, 97 Roli, 2002 Lam, 2000, Classifier combinations: implementations and theoretical issues, 77 Rahman, 1999, Serial combination of multiple experts: a unified evaluation, Pattern Analysis and Applications, 2, 292, 10.1007/s100440050038 Fumera, 2004, A two-stage classifier with reject option for text categorisation, vol. 3138, 771 Bartlett, 2008, Classification with a reject option using a hinge loss, Journal of Machine Learning Research, 9, 1823 Termenon, 2012, A two stage sequential ensemble applied to the classification of alzheimer’s disease based on MRI features, Neural Processing Letters, 35, 1, 10.1007/s11063-011-9200-2 Clark, 1989, The CN2 induction algorithm, Machine Learning, 3, 261, 10.1007/BF00116835 Rivest, 1987, Learning decision lists, Machine Learning, 2, 229, 10.1007/BF00058680 Freund, 1997, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences, 55, 119, 10.1006/jcss.1997.1504 Wu, 2008, Top 10 algorithms in data mining, Knowledge and Information Systems, 14, 1, 10.1007/s10115-007-0114-2 Schapire, 1990, The strength of weak learnability, Machine Learning, 5, 197, 10.1007/BF00116037 J. Kivinen, M.K. Warmuth, Boosting as entropy projection, in: Proceedings of the Twelfth Annual Conference on Computational Learning Theory, 1999. <http://dl.acm.org/citation.cfm?id=307424>. Partridge, 1997, Software diversity: practical statistics for its measurement and exploitation, Information and Software Technology, 39, 707, 10.1016/S0950-5849(97)00023-2 G. Brown, L. Kuncheva, “good” And “bad” diversity in majority vote ensembles, in: Proceedings MCS 2010, pp. 124–133. Smetek, 2011, Selection of heterogeneous fuzzy model ensembles using self-adaptive genetic algorithms, New Generation Computing, 29, 309, 10.1007/s00354-010-0305-3 Sharkey, 1997, Combining diverse neural nets, Knowledge Engineering Review, 12, 231, 10.1017/S0269888997003123 Krogh, 1995, Neural network ensembles, cross validation, and active learning, Advances in Neural Information Processing Systems, 7, 231 Zenobi, 2001, Using diversity in preparing ensembles of classifiers based on different feature subsets to minimize generalization error, Machine Learning: ECML, 2001, 576 Brown, 2005, Diversity creation methods: a survey and categorisation, Information Fusion, 6, 5, 10.1016/j.inffus.2004.04.004 N. Ueda, R. Nakano, Generalization error of ensemble estimators, in: Proceedings of IEEE International Conference on Neural Networks, Washington, USA, 1996, pp. 90–95. Brown, 2005, Managing diversity in regression ensembles, Journal of Machine Learning Research, 6, 1621 Kuncheva, 2003, Limits on the majority vote accuracy in classifier fusion, Pattern Analysis and Applications, 6, 22, 10.1007/s10044-002-0173-7 Bi, 2012, The impact of diversity on the accuracy of evidential classifier ensembles, International Journal of Approximate Reasoning, 53, 584, 10.1016/j.ijar.2011.12.011 Margineantu, 1997, Pruning adaptive boosting, 211 D. Skalak, The sources of increased accuracy for two proposed boosting algorithms, in: Proceedings of the American Association for Arti Intelligence, AAAI-96, Integrating Multiple Learned Models Workshop, 1996, pp. 120–125. Giacinto, 2001, Design of effective neural network ensembles for image classification purposes, Image Vision Computing, 19, 699, 10.1016/S0262-8856(01)00045-2 R. Kohavi, D. Wolpert, Bias plus variance decomposition for zero-one loss functions, in: ICML-96, 1996. Fleiss, 1979, The reliability of dichotomous judgments: unequal numbers of judgments per subject, Applied Psychological Measurement, 4, 537, 10.1177/014662167900300410 Cunningham, 2000, Diversity versus quality in classification ensembles based on feature selection, 109 Shipp, 2002, Relationships between combination methods and measures of diversity in combining classifiers, Information Fusion, 3, 135, 10.1016/S1566-2535(02)00051-9 Tang, 2006, An analysis of diversity measures, Machine Learning, 65, 247, 10.1007/s10994-006-9449-2 Martinez-Mu/ noz, 2009, An analysis of ensemble pruning techniques based on ordered aggregation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 31, 245, 10.1109/TPAMI.2008.78 Ruta, 2005, Classifier selection for majority voting, Information Fusion, 6, 63, 10.1016/j.inffus.2004.04.008 Banfield, 2005, Ensemble diversity measures and their application to thinning, Information Fusion, 6, 49, 10.1016/j.inffus.2004.04.005 Zhou, 2002, Ensembling neural networks: many could be better than all, Artificial Intelligence, 137, 239, 10.1016/S0004-3702(02)00190-X Gabrys, 2006, Genetic algorithms in classifier fusion, Applied Soft Computing, 6, 337, 10.1016/j.asoc.2005.11.001 Partalas, 2009, Pruning an ensemble of classifiers via reinforcement learning, Neurocomputing, 72, 1900, 10.1016/j.neucom.2008.06.007 Dai, 2012, A competitive ensemble pruning approach based on cross-validation technique, Knowledge-Based Systems Peng, 2005, Cost-sensitive ensemble of support vector machines for effective detection of microcalcification in breast cancer diagnosis, vol. 3614, 483 Jackowski, 2012, Cost-sensitive splitting and selection method for medical decision support system, vol. 7435, 850 W. Du, Z. Zhan, Building decision tree classifier on private data, in: Proceedings of the IEEE International Conference on Privacy, Security and Data Mining – Volume 14, CRPIT ’14, Australian Computer Society, Inc., Darlinghurst, Australia, 2002, pp. 1–8. Krawczyk, 2011, Privacy preserving models of k-NN algorithm, vol. 95, 207 Lindell, 2008, Secure multiparty computation for privacy-preserving data mining, IACR Cryptology ePrint Archive, 2008, 197 Walkowiak, 2011, Decentralized distributed computing system for privacy-preserving combined classifiers – modeling and optimization, Vol. 6782, 512 Pavlo, 2009, A comparison of approaches to large-scale data analysis, 165 R.E. Schapire, The boosting approach to machine learning: an overview, in: MSRI Workshop on Nonlinear Estimation and Classification, Berkeley, CA, USA, 2001. T. Ho, Random decision forests, in: Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1)–Volume 1, ICDAR ’95, IEEE Computer Society, Washington, DC, USA, 1995, pp. 278–. Ho, 1998, The random subspace method for constructing decision forests, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 832, 10.1109/34.709601 Breiman, 2001, Random forests, Machine Learning, 45, 5, 10.1023/A:1010933404324 Skurichina, 2002, Bagging, boosting and the random subspace method for linear classifiers, Pattern Analysis and Applications, 5, 121, 10.1007/s100440200011 G. Tremblay, R. Sabourin, P. Maupin, Optimizing nearest neighbour in random subspaces using a multi-objective genetic algorithm, in: Proceedings of the Pattern Recognition, 17th International Conference on (ICPR’04) Volume 1–Volume 01, ICPR ’04, IEEE Computer Society, Washington, DC, USA, 2004, pp. 208–. Bay, 1999, Nearest neighbor classification from multiple feature subsets, Intelligent Data Analysis, 3, 191, 10.1016/S1088-467X(99)00018-9 Nanni, 2006, Letters: Experimental comparison of one-class classifiers for online signature verification, Neurocomputing, 69, 869, 10.1016/j.neucom.2005.06.007 Tao, 2006, Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval, IEEE Transactions on Pattern Analysis Machine Intelligence, 28, 1088, 10.1109/TPAMI.2006.134 Ting, 2011, Feature-subspace aggregating: ensembles for stable and unstable learners, Machine Learning, 82, 375, 10.1007/s10994-010-5224-5 Bryll, 2003, Attribute bagging: improving accuracy of classifier ensembles by using random feature subsets, Pattern Recognition, 36, 1291, 10.1016/S0031-3203(02)00121-8 Baram, 1998, Partial classification: the benefit of deferred decision, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 769, 10.1109/34.709564 Cordella, 2000, A cascaded multiple expert system for verification, vol. 1857, 330 K. Goebel, W. Yan, Choosing classifiers for decision fusion, in: Proceedings of the Seventh International Conference on Information Fusion, 2004, pp. 563–568. Baruque, 2011, Hybrid classification ensemble using topology-preserving clustering, New Generation Computing, 29, 329, 10.1007/s00354-011-0306-x L. Kuncheva, Clustering-and-selection model for classifier combination, in: Proceedings of the Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies, 2000, vol. 1, 2000, pp. 185–188. Jackowski, 2009, Algorithm of designing compound recognition system on the basis of combining classifiers with simultaneous splitting feature space into competence areas, Pattern Analysis and Applications, 12, 415, 10.1007/s10044-008-0137-7 M. Wozniak, B. Krawczyk, Combined classifier based on feature space partitioning, International Journal of Applied Mathematics and Computer Sciences 22 (4) (2012) 855–866. Lee, 2001, An efficient fuzzy classifier with feature selection based on fuzzy entropy, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 31, 426, 10.1109/3477.931536 Hong, 2008, Fingerprint classification using one-vs-all support vector machines dynamically ordered with naïve bayes classifiers, Pattern Recognition, 41, 662, 10.1016/j.patcog.2007.07.004 Ko, 2008, From dynamic classifier selection to dynamic ensemble selection, Pattern Recognition, 41, 1735, 10.1016/j.patcog.2007.10.015 Didaci, 2005, A study on the performances of dynamic classifier selection based on local accuracy estimation, Pattern Recognition, 38, 2188, 10.1016/j.patcog.2005.02.010 Giacinto, 2001, Dynamic classifier selection based on multiple classifier behavior, Pattern Recognition, 34, 1879, 10.1016/S0031-3203(00)00150-3 M. de Souto, R. Soares, A. Santana, A. Canuto, Empirical comparison of dynamic classifier selection methods based on diversity and accuracy for building ensembles, in: IJCNN 2008, IEEE International Joint Conference on Neural Networks, 2008, IEEE World Congress on Computational Intelligence, 2008, pp. 1480–1487. Woloszynski, 2011, A probabilistic model of classifier competence for dynamic ensemble selection, Pattern Recognition, 44, 2656, 10.1016/j.patcog.2011.03.020 Woloszynski, 2012, A measure of competence based on random classification for dynamic ensemble selection, Information Fusion, 13, 207, 10.1016/j.inffus.2011.03.007 Street, 2001, A streaming ensemble algorithm (sea) for large-scale classification, 377 Wang, 2003, Mining concept-drifting data streams using ensemble classifiers, 226 Zhang, 2006, An automatic construction and organization strategy for ensemble learning on data streams, SIGMOD Record, 35, 28, 10.1145/1168092.1168096 J. Kolter, M. Maloof, Dynamic weighted majority: a new ensemble method for tracking concept drift, in: ICDM 2003, Third IEEE International Conference on Data Mining, 2003, 2003, pp. 123–130. Tsymbal, 2008, Dynamic integration of classifiers for handling concept drift, Information Fusion, 9, 56, 10.1016/j.inffus.2006.11.002 Zhu, 2006, Effective classification of noisy data streams with attribute-oriented dynamic classifier selection, Knowledge Information Systems, 9, 339, 10.1007/s10115-005-0212-y D. Tax, R. Duin, Using two-class classifiers for multiclass classification, in: Proceedings of the 16th International Conference on Pattern Recognition, 2002, vol. 2, 2002, pp. 124 –127. Dietterich, 1995, Solving multiclass learning problems via error-correcting output codes, Journal of Artificial Intelligence Research, 2, 263, 10.1613/jair.105 Duan, 2003, Multi-category classification by soft-max combination of binary classifiers, 125 Passerini, 2004, New results on error correcting output codes of kernel machines, IEEE Transactions on Neural Networks, 15, 45, 10.1109/TNN.2003.820841 Wu, 2004, Probability estimates for multi-class classification by pairwise coupling, Journal of Machine Learning Research, 5, 975 J. Friedman, Another Approach to Polychotomous Classification, Tech. rep., Department of Statistics, Stanford University, 1996. Hüllermeier, 2010, Combining predictions in pairwise classification: an optimal adaptive voting strategy and its relation to weighted voting, Pattern Recognition, 43, 128, 10.1016/j.patcog.2009.06.013 Galar, 2011, An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes, Pattern Recognition, 44, 1761, 10.1016/j.patcog.2011.01.017 D. Tax, R.P.W. Duin, Characterizing one-class datasets, in: Proceedings of the Sixteenth Annual Symposium of the Pattern Recognition Association of South Africa, 2005, pp. 21–26. Tax, 2001, Combining one-class classifiers, 299 Wilk, 2012, Soft computing methods applied to combination of one-class classifiers, Neurocomputing, 75, 185, 10.1016/j.neucom.2011.02.023 Giacinto, 2008, Intrusion detection in computer networks by a modular ensemble of one-class classifiers, Information Fusion, 9, 69, 10.1016/j.inffus.2006.10.002 Hu, 2000 Woods, 1997, Combination of multiple classifiers using local accuracy estimates, IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, 405, 10.1109/34.588027 Wozniak, 2010, Combining classifiers using trained fuser – analytical and experimental results, Neural Network World, 13, 925 Raudys, 2006, Trainable fusion rules. I. Large sample size case, Neural Networks, 19, 1506, 10.1016/j.neunet.2006.01.018 M. van Erp, L. Vuurpijl, L. Schomaker, An overview and comparison of voting methods for pattern recognition, in: Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition, 2002, 2002, pp. 195–200. Wozniak, 2009, Some remarks on chosen methods of classifier fusion based on weighted voting, vol. 5572, 541 Raudys, 2006, Trainable fusion rules. II. Small sample-size effects, Neural Networks, 19, 1517, 10.1016/j.neunet.2006.01.019 Inoue, 2002, Optimizing a multiple classifier system, vol. 2417, 1 L. Alexandre, A. Campilho, M. Kamel, Combining independent and unbiased classifiers using weighted average., in: Proceedings ICPR 2000, 2000, pp. 2495–2498. Biggio, 2007, Bayesian analysis of linear combiners, 292 Kittler, 2003, Sum versus vote fusion in multiple classifier systems, IEEE Transactions on Pattern Analysis and Machine Intelligence, 25, 110, 10.1109/TPAMI.2003.1159950 Rao, 2004, A generic sensor fusion problem: classification and function estimation, vol. 3077, 16 D. Opitz, J. Shavlik, Generating accurate and diverse members of a neural-network ensemble, in: NIPS, 1995, pp. 535–541. Rokach, 2005, Feature set decomposition for decision trees, Intelligent Data Analysis, 9, 131, 10.3233/IDA-2005-9202 Fumera, 2005, A theoretical and experimental analysis of linear combiners for multiple classifier systems, IEEE Transactions on Pattern Analysis and Machine Intelligence, 27, 942, 10.1109/TPAMI.2005.109 Wozniak, 2008, Experiments on linear combiners, vol. 47, 445 R. Duin, The combining classifier: to train or not to train? in: Proceedings of the 16th International Conference on Pattern Recognition, 2002, vol. 2, 2002, pp. 765–770. Jacobs, 1991, Adaptive mixtures of local experts, Neural Computation, 3, 79, 10.1162/neco.1991.3.1.79 Jacobs, 1995, Methods for combining experts’ probability assessments, Neural Computation, 7, 867, 10.1162/neco.1995.7.5.867 Tresp, 1995, Combining estimators using non-constant weighting functions, vol. 7, 419 Cheeseman, 1988, AutoClass: a Bayesian classification system Shlien, 1990, Multiple binary decision tree classifiers, Pattern Recognition, 23, 757, 10.1016/0031-3203(90)90098-6 Wozniak, 2007, Experiments with trained and untrained fusers, vol. 44, 144 Wozniak, 2009, Evolutionary approach to produce classifier ensemble based on weighted voting, 648 L. Lin, X. Wang, B. Liu, Combining multiple classifiers based on statistical method for handwritten chinese character recognition, in: Proceedings of the 2002 International Conference on Machine Learning and Cybernetics, 2002, vol. 1, 2002, pp. 252–255. Zheng, 2007, Constructing ensembles from data envelopment analysis, INFORMS Journal on Computing, 19, 486, 10.1287/ijoc.1060.0180 Wolpert, 1992, Stacked generalization, Neural Networks, 5, 241, 10.1016/S0893-6080(05)80023-1 Huang, 1995, A method of combining multiple experts for the recognition of unconstrained handwritten numerals, IEEE Transactions on Pattern Analysis and Machine Intelligence, 17, 90, 10.1109/34.368145 Gaber, 2005, Mining data streams: a review, SIGMOD Record, 34, 18, 10.1145/1083784.1083789 Patcha, 2007, An overview of anomaly detection techniques: existing solutions and latest technological trends, Computer Network, 51, 3448, 10.1016/j.comnet.2007.02.001 Black, 2002, Classification of customer call data in the presence of concept drift and noise, 74 Wang, 2003, Mining concept-drifting data streams using ensemble classifiers, 226 M.M. Gaber, P.S. Yu, Classification of changes in evolving data streams using online clustering result deviation, in: Proc. Of International Workshop on Knowledge Discovery in Data Streams, 2006. Markou, 2003, Novelty detection: a review – Part 1: Statistical approaches, Signal Process, 83, 2481, 10.1016/j.sigpro.2003.07.018 Salganicoff, 1993, Density-adaptive learning and forgetting Klinkenberg, 2000, Detecting concept drift with support vector machines, 487 M. Baena-García, J. del Campo-Ávila, R. Fidalgo, A. Bifet, R. Gavaldá, R. Morales-Bueno, Early drift detection method, in: Fourth International Workshop on Knowledge Discovery from Data Streams, 2006. Zliobaite, 2010, Change with delayed labeling: when is it detectable?, 843 Giacinto, 2000, Combination of neural and statistical algorithms for supervised classification of remote-sensing images, Pattern Recognition Letters, 21, 385, 10.1016/S0167-8655(00)00006-4 Rodriguez-Galiano, 2012, An assessment of the effectiveness of a random forest classifier for land-cover classification, ISPRS Journal of Photogrammetry and Remote Sensing, 67, 93, 10.1016/j.isprsjprs.2011.11.002 Gislason, 2006, Random forests for land cover classification, Pattern Recognition Letters, 27, 294, 10.1016/j.patrec.2005.08.011 Chan, 2008, Evaluation of random forest and Adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery, Remote Sensing of Environment, 112, 2999, 10.1016/j.rse.2008.02.011 Peters, 2009, Uncertainty propagation in vegetation distribution models based on ensemble classifiers, Ecological Modelling, 220, 791, 10.1016/j.ecolmodel.2008.12.022 Han, 2012, Remote sensing image classification based on neural network ensemble algorithm, Neurocomputing, 78, 133, 10.1016/j.neucom.2011.04.044 Waske, 2009, Classifier ensembles for land cover mapping using multitemporal SAR imagery, ISPRS Journal of Photogrammetry and Remote Sensing, 64, 450, 10.1016/j.isprsjprs.2009.01.003 Waske, 2012, imageRF – a user-oriented implementation for remote sensing image analysis with random forests, Environmental Modelling & Software, 35, 192, 10.1016/j.envsoft.2012.01.014 Maulik, 2011, A self-trained ensemble with semisupervised SVM: an application to pixel classification of remote sensing imagery, Pattern Recognition, 44, 615, 10.1016/j.patcog.2010.09.021 Henriques, 2010, Classification of multispectral images in coral environments using a hybrid of classifier ensembles, Neurocomputing, 73, 1256, 10.1016/j.neucom.2010.01.003 Maghsoudi, 2012, Polarimetric classification of boreal forest using nonparametric feature selection and multiple classifiers, International Journal of Applied Earth Observation and Geoinformation, 19, 139, 10.1016/j.jag.2012.04.015 Bruzzone, 2002, Combining parametric and non-parametric algorithms for a partially unsupervised classification of multitemporal remote-sensing images, Information Fusion, 3, 289, 10.1016/S1566-2535(02)00091-X Bruzzone, 2004, Detection of land-cover transitions by combining multidate classifiers, Pattern Recognition Letters, 25, 1491, 10.1016/j.patrec.2004.06.002 Du, 2013, Information fusion techniques for change detection from multi-temporal remote sensing images, Information Fusion, 14, 19, 10.1016/j.inffus.2012.05.003 Arun-Raj-Kumar, 2011, Distributed denial of service attack detection using an ensemble of neural classifier, Computer Communications, 34, 1328, 10.1016/j.comcom.2011.01.012 Kumar, 2012, Detection of distributed denial of service attacks using an ensemble of adaptive and hybrid neuro-fuzzy systems, Computer Communications Shabtai, 2009, Detection of malicious code by applying machine learning classifiers on static features: a state-of-the-art survey, Information Security Technical Report, 14, 16, 10.1016/j.istr.2009.03.003 Locasto, 2006, Flips: hybrid adaptive intrusion prevention, 82 Wang, 2006, Anomalous payload-based worm detection and signature generation, 227 Peddabachigari, 2007, Modeling intrusion detection system using hybrid intelligent systems, Journal of Network and Computer Applications, 30, 114, 10.1016/j.jnca.2005.06.003 Curiac, 2012, Ensemble based sensing anomaly detection in wireless sensor networks, Expert Systems with Applications, 39, 9087, 10.1016/j.eswa.2012.02.036 Bolton, 2002, Statistical fraud detection: a review, Statistical Science, 17, 235, 10.1214/ss/1042727940 Louzada, 2012, Bagging k-dependence probabilistic networks: an alternative powerful fraud detection tool, Expert Systems with Applications, 39, 11583, 10.1016/j.eswa.2012.04.024 Bhattacharyya, 2011, Data mining for credit card fraud: a comparative study, Decision Support Systems, 50, 602, 10.1016/j.dss.2010.08.008 Yu, 2010, Support vector machine based multiagent ensemble learning for credit risk evaluation, Expert Systems with Applications, 37, 1351, 10.1016/j.eswa.2009.06.083 Kim, 2012, Stock fraud detection using peer group analysis, Expert Systems with Applications, 39, 8986, 10.1016/j.eswa.2012.02.025 Twala, 2010, Multiple classifier application to credit risk assessment, Expert Systems with Applications, 37, 3326, 10.1016/j.eswa.2009.10.018 Finlay, 2011, Multiple classifier architectures and their application to credit risk assessment, European Journal of Operational Research, 210, 368, 10.1016/j.ejor.2010.09.029 Wang, 2012, A hybrid ensemble approach for enterprise credit risk assessment based on support vector machine, Expert Systems with Applications, 39, 5325, 10.1016/j.eswa.2011.11.003 Kim, 2012, Classifiers selection in ensembles using genetic algorithms for bankruptcy prediction, Expert Systems with Applications, 39, 9308, 10.1016/j.eswa.2012.02.072 Ravisankar, 2010, Failure prediction of dotcom companies using neural network–genetic programming hybrids, Information Sciences, 180, 1257, 10.1016/j.ins.2009.12.022 Ravisankar, 2011, Detection of financial statement fraud and feature selection using data mining techniques, Decision Support Systems, 50, 491, 10.1016/j.dss.2010.11.006 Tsai, 2011, Combining cluster analysis with classifier ensembles to predict financial distress, Information Fusion Peng, 2011, An empirical study of classification algorithm evaluation for financial risk prediction, Applied Soft Computing, 11, 2906, 10.1016/j.asoc.2010.11.028 Ravi, 2008, Soft computing system for bank performance prediction, Applied Soft Computing, 8, 305, 10.1016/j.asoc.2007.02.001 Zhao, 2009, Effects of feature construction on classification performance: an empirical study in bank failure prediction, Expert Systems with Applications, 36, 2633, 10.1016/j.eswa.2008.01.053 Aral, 2012, A prescription fraud detection model, Computer Methods and Programs in Biomedicine, 106, 37, 10.1016/j.cmpb.2011.09.003 Christou, 2011, Detecting fraud in online games of chance and lotteries, Expert Systems with Applications, 38, 13158, 10.1016/j.eswa.2011.04.124 Farvaresh, 2011, A data mining framework for detecting subscription fraud in telecommunication, Engineering Applications of Artificial Intelligence, 24, 182, 10.1016/j.engappai.2010.05.009 Subelj, 2011, An expert system for detecting automobile insurance fraud using social network analysis, Expert Systems with Applications, 38, 1039, 10.1016/j.eswa.2010.07.143 Garg, 2005, Effects of computerized clinical decision support systems on practitioner performance and patient outcomes: a systematic review, Journal of the American Medical Association, 293, 1223, 10.1001/jama.293.10.1223 Eom, 2008, AptaCDSS-E: a classifier ensemble-based clinical decision support system for cardiovascular disease level prediction, Expert Systems with Applications, 34, 2465, 10.1016/j.eswa.2007.04.015 Das, 2009, Effective diagnosis of heart disease through neural networks ensembles, Expert Systems with Applications, 36, 7675, 10.1016/j.eswa.2008.09.013 Das, 2009, Diagnosis of valvular heart disease through neural networks ensembles, Computer Methods and Programs in Biomedicine, 93, 185, 10.1016/j.cmpb.2008.09.005 Baxt, 1992, Improving the accuracy of an artificial neural network using multiple differently trained networks, Neural Computation, 4, 772, 10.1162/neco.1992.4.5.772 Zhang, 1992, Hybrid system for protein secondary structure prediction, Journal of Molecular Biology, 225, 1049, 10.1016/0022-2836(92)90104-R Nanni, 2006, Ensemble of classifiers for protein fold recognition, Neurocomputing, 69, 850, 10.1016/j.neucom.2005.08.006 Yang, 2011, Margin-based ensemble classifier for protein fold recognition, Expert Systems with Applications, 38, 12348, 10.1016/j.eswa.2011.04.014 Savio, 2011, Neurocognitive disorder detection based on feature vectors extracted from VBM analysis of structural MRI, Computers in Biology and Medicine, 41, 600, 10.1016/j.compbiomed.2011.05.010 Chyzhyk, 2012, Hybrid dendritic computing with kernel-LICA applied to alzheimer’s disease detection in MRI, Neurocomputing, 75, 72, 10.1016/j.neucom.2011.02.024 Kuncheva, 2010, Classifier ensembles for fMRI data analysis: an experiment, Magnetic Resonance Imaging, 28, 583, 10.1016/j.mri.2009.12.021 Plumpton, 2012, Naive random subspace ensemble with linear classifiers for real-time classification of fMRI data, Pattern Recognition, 45, 2101, 10.1016/j.patcog.2011.04.023 Cabral, 2012, Decoding visual brain states from fMRI using an ensemble of classifiers, Pattern Recognition, 45, 2064, 10.1016/j.patcog.2011.04.015 Adomavicius, 2005, Incorporating contextual information in recommender systems using a multidimensional approach, ACM Transactions Information Systems, 23, 103, 10.1145/1055709.1055714 Konstan, 2012, How online merchants predict your preferences and prod you to purchase, IEEE Spectrum, 49, 48 Burke, 2002, Hybrid recommender systems: survey and experiments, User Modeling and User-Adapted Interaction, 12, 331, 10.1023/A:1021240730564 Balabanović, 1997, collaborative recommendation, Communications of the ACM, 40, 66, 10.1145/245108.245124 Pazzani, 1999, A framework for collaborative, content-based and demographic filtering, Artificial Intelligence Review, 13, 393, 10.1023/A:1006544522159 Jahrer, 2010, Combining predictions for accurate recommender systems, 693 Porcel, 2012, A hybrid recommender system for the selective dissemination of research resources in a technology transfer office, Information Sciences, 184, 1, 10.1016/j.ins.2011.08.026 Claypool, 1999, Combining content-based and collaborative filters in an online newspaper Billsus, 2000, User modeling for adaptive news access, User Modeling and User-Adapted Interaction, 10, 147, 10.1023/A:1026501525781 T. Tran, R. Cohen, Hybrid recommender systems for electronic commerce, in: Knowledge-Based Electronic Markets, Papers from the AAAI Workshop, AAAI Technical Report WS-00-04, AAAI Press, Menlo Park, CA, 2000, pp. 78–83. Kunaver, 2007, Optimisation of combined collaborative recommender systems, AEU – International Journal of Electronics and Communications, 61, 433, 10.1016/j.aeue.2007.04.003 Goksedef, 2010, Combination of web page recommender systems, Expert Systems with Applications, 37, 2911, 10.1016/j.eswa.2009.09.046