Genetic lateral tuning for subgroup discovery with fuzzy rules using the algorithm NMEEF-SD

C. J. Carmona1, P. González1, M. J. Gacto1, M. J. del Jesus1
1Department of Computer Science, University of Jaen, Campus Las Lagunillas, Jaen, Spain

Tóm tắt

The main objective of subgroup discovery is to discover interesting and interpretable patterns with respect to a specific property. The use of evolutionary fuzzy systems provides good algorithms to approach this problem. In this sense, NMEEF-SD algorithm —one of the most representative evolutionary fuzzy systems for subgroup discovery— obtains precise and interpretable subgroups. However in the majority of the evolutionary fuzzy systems, the membership functions of the linguistic labels are usually fixed to static values and the partitions are not adapted to the context of each variable. In this paper, a post-processing tuning step to improve the results of the subgroup discovery algorithm NMEEF-SD is proposed, allowing the partitions to be adapted to the context the variables. The application of this tuning step is a novelty in subgroup discovery and consist of a genetic algorithm which allows the lateral displacement of the membership functions of a label considering a unique parameter, using the 2-tuples linguistic representation. The results obtained using different data sets of the KEEL repository show the improvement in the performance of the NMEEF-SD algorithm with lateral displacement. The study is supported by statistical tests to improve the analysis performed.

Tài liệu tham khảo

R. Alcalá, J. Alcalá-Fdez, M. J. Gacto, and F. Herrera, Rule base reduction and genetic tuning of fuzzy systems based on the linguistic 3-tuples representation, Soft Computing 11 (2007), no. 5, 401–419. R. Alcalá, J. Alcalá-Fdez, and F. Herrera, A proposal for the genetic lateral tuning of linguistic fuzzy systems and its interaction with rule selection, IEEE Transactions on Fuzzy Systems 15 (2007), no. 4, 616–635. R. Alcalá, J. Alcalá-Fdez, F. Herrera, and J. Otero, Genetic learning of accurate and compact fuzzy rule based systems based on the 2-tuples linguistic representation, International Journal of Approximate Reasoning 44 (2007), no. 1, 45–64. R. Alcalá, P. Ducange, F. Herrera, B. Lazzerini, and F. Marcelloni, A Multi-Objective Evolutionary Approach to Concurrently Learn Rule and Data Bases of Linguistic Fuzzy Rule-Based Systems, IEEE Transactions on Fuzzy Systems 17 (2009), no. 5, 1106–1122. J. Alcalá-Fdez, R. Alcalá, M. J. Gacto, and F. Herrera, Learning the Membership Function Contexts for Mining Fuzzy Association Rules by Using Genetic Algorithms, Fuzzy Sets and Systems 160 (2009), no. 7, 905–921. J. Alcalá-Fdez, A. Fernández, J. Luengo, J. Derrac, S. García, L. Sánchez, and F. Herrera, KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework, Journal of Multiple-Valued Logic and Soft Computing 17 (2011), no. 2–3, 255–287. J. Alcalá-Fdez, L. Sánchez, S. García, M.J. del Jesus, S. Ventura, J.M. Garrell, J. Otero, C. Romero, J. Bacardit, V.M. Rivas, J.C. Fernández, and F. Herrera, KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems, Soft Computing 13 (2009), no. 3, 307–318. M. Atzmueller and F. Puppe, SD-Map - A Fast Algorithm for Exhaustive Subgroup Discovery, Proceedings of the 17th European Conference on Machine Learning and 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, LNCS, vol. 4213, Springer, 2006, pp. 6–17. F. J. Berlanga, M. J. del Jesus, P. González, F. Herrera, and M. Mesonero, Multiobjective Evolutionary Induction of Subgroup Discovery Fuzzy Rules: A Case Study in Marketing, Proceedings of the 6th Industrial Conference on Data Mining, LNCS, vol. 4065, Springer, 2006, pp. 337–349. C. J. Carmona, P. González, M. J. del Jesus, and F. Herrera, NMEEF-SD: Non-dominated Multiobjective Evolutionary algorithm for Extracting Fuzzy rules in Subgroup Discovery, IEEE Transactions on Fuzzy Systems 18 (2010), no. 5, 958–970. C. J. Carmona, P. González, M. J. del Jesus, M. Navío, and L. Jiménez, Evolutionary Fuzzy Rule Extraction for Subgroup Discovery in a Psychiatric Emergency Department, Soft Computing, in press, doi: 10.1007/s00500-010-0670-3. J. Casillas and B. Carse, Special issue on Genetic Fuzzy Systems: Recent Developments and Future Directions, Soft Computing 13 (2009), no. 5, 417–418. O. Cordón, R. Alcalá, J. Alcalá-Fdez, and I. Rojas, Special Issue on Genetic Fuzzy Systems: What’s Next?, Editorial, IEEE Transactions on Fuzzy Systems 15 (2007), no. 4, 533–535. O. Cordón, M. J. del Jesus, F. Herrera, and M. Lozano, MOGUL: A Methodology to obtain genetic fuzzy rule-based systems under the iterative rule learning approach, Internation Journal of Intelligent Systems 14 (1999), 1123–1153. O. Cordón, F. A. C. Gomide, F. Herrera, F. Hoffmann, and L. Magdalena, Ten years of genetic fuzzy systems. Current framework and new trends, Fuzzy Sets and Systems 14 (2004), 5–31. O. Cordón, F. Herrera, F. Hoffmann, and L. Magdalena, Genetic Fuzzy Systems: Evolutionary Tuning and Learning of Fuzzy Knowledge Bases, World Scientific, 2001. K. Deb, A. Pratap, S. Agrawal, and T. Meyarivan, A fast and elitist multiobjective genetic algorithm: NSGA-II, IEEE Transactions Evolutionary Computation 6 (2002), no. 2, 182–197. M. J. del Jesus, P. González, and F. Herrera, Multiobjective Genetic Algorithm for Extracting Subgroup Discovery Fuzzy Rules, Proceedings of the IEEE Symposium on Computational Intelligence in Multicriteria Decision Making, IEEE Press, 2007, pp. 50–57. M. J. del Jesus, P. González, F. Herrera, and M. Mesonero, Evolutionary Fuzzy Rule Induction Process for Subgroup Discovery: A case study in marketing, IEEE Transactions on Fuzzy Systems 15 (2007), no. 4, 578–592. J. Demsar, Statistical comparisons of classifiers over multiple data sets, Journal Machine Learning Research 7 (2006), 1–30. A. E. Eiben and J. E. Smith, Introduction to evolutionary computation, Springer, 2003. L. J. Eshelman, Foundations of genetic algorithms, ch. The CHC Adaptive Search Algorithm: How to have Safe Search When Engaging in Nontraditional Genetic Recombination, pp. 265–283, 1991. L. J. Eshelman and J. D. Schaffer, Preventing premature convergence in genetic algorithms by preventing incest, Proceedings of the 4th International Conference on Genetic Algorithms, 1991, pp. 115–122. A. Fernández, M. J. del Jesus, and F. Herrera, On the 2-Tuples Based Genetic Tuning Performance for Fuzzy Rule Based Classification Systems in Imbalanced Data-SetsInformation Sciences 180:8 (2010) 1268–1291, Information Sciences 180 (2010), no. 8, 1268–1291. D. Gamberger and N. Lavrac, Expert-Guided Subgroup Discovery: Methodology and Application, Journal Artificial Intelligence Research 17 (2002), 501–527. S. García, A. Fernández, J. Luengo, and F. Herrera, Study of Statistical Techniques and Performance Measures for Genetics-Based Machine Learning: Accuracy and Interpretability, Soft Computing 13 (2009), no. 10, 959–977. S. García, A. Fernandez, J. Luengo, and F. Herrera, Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental Analysis of Power, Information Sciences 180 (2010), 2044–2064. F. Herrera, Genetic fuzzy systems: taxomony, current research trends and prospects, Evolutionary Intelligence 1 (2008), 27–46. F. Herrera, C. J. Carmona, P. González, and M. J. del Jesus, An overview on Subgroup Discovery: Foundations and Applications, Knowledge and Information Systems, in press, doi: 10.1007/s10115-010-0356-2. F. Herrera, M. Lozano, and A.M. Sánchez, A taxonomy for the crossover operator for real-coded genetic algorithms: an experimental study, International Journal of Intelligent Systems 18 (2003), 309–338. F. Herrera and L. Martínez, A 2-tuple fuzzy linguistic representation model for computing with words, IEEE Transactions on Fuzzy Systems 8 (2000), no. 6, 746–752. H. Ishibuchi, T. Nakashima, and M. Nii, Classification and modeling with linguistic information granules: Advanced approaches to linguistic data mining, Springer, 2004. B. Kavsek and N. Lavrac, APRIORI-SD: Adapting association rule learning to subgroup discovery, Applied Artificial Intelligence 20 (2006), 543–583. W. Kloesgen, Explora: A Multipattern and Multistrategy Discovery Assistant, Advances in Knowledge Discovery and Data Mining, American Association for Artificial Intelligence, 1996, pp. 249–271. L. Kuncheva, Fuzzy classifier design, Springer, 2000. N. Lavrac, B. Cestnik, D. Gamberger, and P. A. Flach, Decision Support Through Subgroup Discovery: Three Case Studies and the Lessons Learned, Machine Learning 57 (2004), no. 1–2, 115–143. N. Lavrac, P. A. Flach, and B. Zupan, Rule Evaluation Measures: A Unifying View, Proceedings of the 9th International Workshop on Inductive Logic Programming, LNCS, vol. 1634, Springer, 1999, pp. 174–185. N. Lavrac, B. Kavsek, P. A. Flach, and L. Todorovski, Subgroup Discovery with CN2-SD, Journal of Machine Learning Research 5 (2004), 153–188. Y. Nojima, R. Alcalá, H. Ishibuchi, and F. Herrera, Special Issue on Evolutionary Fuzzy Systems, Soft-Computing, in press, doi: 10.1007/s00500-010-0663-2. I. Robles, R. Alcalá, J. M. Benítez, and F. Herrera, Evolutionary Parallel and Gradually Distributed Lateral Tuning of Fuzzy Rule-Based Systems, Evolutionary Intelligence 2 (2009), 5–19. C. Romero, P. González, S. Ventura, M. J. del Jesus, and F. Herrera, Evolutionary algorithm for subgroup discovery in e-learning: A practical application using Moodle data, Expert Systems with Applications 36 (2009), 1632–1644. D. Sheskin, Handbook of parametric and nonparametric statistical procedures, 2 ed., Chapman and Hall/CRC, 2006. F. Wilcoxom, Individual comparisons by ranking methods, Biometrics 1 (1945), 80–83. S. Wrobel, An Algorithm for Multi-relational Discovery of Subgroups, Proceedings of the 1st European Symposium on Principles of Data Mining and Knowledge Discovery, LNAI, vol. 1263, Springer, 1997, pp. 78–87. L. A. Zadeh, The concept of a linguistic variable and its applications to approximate reasoning. Parts I, II, III, Information Science 8–9 (1975), 199–249,301–357,43–80. E. Zitzler, M. Laumanns, and L. Thiele, SPEA2: Improving the strength pareto evolutionary algorithm for multiobjective optimization, International Congress on Evolutionary Methods for Design Optimization and Control with Applications to Industrial Problems, 2002, pp. 95–100.