Rule extraction using Recursive-Rule extraction algorithm with J48graft combined with sampling selection techniques for the diagnosis of type 2 diabetes mellitus in the Pima Indian dataset

Informatics in Medicine Unlocked - Tập 2 - Trang 92-104 - 2016
Yoichi Hayashi1, Shonosuke Yukita1
1Department of Computer Science, Meiji University, Tama-ku, Kawasaki, Kanagawa 214-8571, Japan

Tóm tắt

Từ khóa


Tài liệu tham khảo

Beloufa, 2013, Design of fuzzy classifier for diabetes disease using modified artificial bee colony algorithm, Comput Methods Prog Biomed, 112, 92, 10.1016/j.cmpb.2013.07.009

Marateb, 2014, A hybrid intelligent system for diagnosing microalbuminuria in type 2 diabetes patients without having to measure urinary albumin, Comput Biol Med, 45, 34, 10.1016/j.compbiomed.2013.11.006

Centers for Disease Control and Prevention, 2014

Zhu, 2015, An improved early detection method of type-2 diabetes mellitus using multiple classifier system, Inf Sci, 292, 1, 10.1016/j.ins.2014.08.056

Hunter, 2005, Gene-environment interactions in human diseases, Nat Rev Genet, 6, 287, 10.1038/nrg1578

Homme, 2008, Dynamic pharmacogenetic models in anticoagulation therapy, Clin Lab Med, 28, 539, 10.1016/j.cll.2008.10.002

Ding, 2015, Extreme learning machine: algorithm, theory and applications, Artif Intell Rev, 44, 103, 10.1007/s10462-013-9405-z

Yilmaz, 2014, A new data preparation method based on clustering algorithms for diagnosis systems of heart and diabetes diseases, J Med Syst, 38, 48, 10.1007/s10916-014-0048-7

Gürbüz, 2014, A new adaptive support vector machine for diagnosis of diseases, Expert Syst, 31, 389, 10.1111/exsy.12051

Shanker, 1996, Using neural networks to predict the onset of diabetes mellitus, J Chem Inf Comput Sci, 36, 35, 10.1021/ci950063e

Park, 2001, A sequential neural network model for diabetes prediction, Artif Intell Med, 23, 277, 10.1016/S0933-3657(01)00086-0

Gadaras, 2009, An interpretable fuzzy rule-based classification methodology for medical diagnosis, Artif Intell Med, 47, 25, 10.1016/j.artmed.2009.05.003

Ghazavi, 2008, Medical data mining by fuzzy modeling with selected features, Artif Intell Med, 43, 195, 10.1016/j.artmed.2008.04.004

Liu, 2013, Extraction of fuzzy rules from fuzzy decision trees: an axiomatic fuzzy sets (AFS) approach, Data Knowl Eng, 84, 1, 10.1016/j.datak.2012.12.001

Chavas, 2013, Fuzzy rules extraction from support vector machines for multi-class classification, Neural Comput Appl, 22, 1571, 10.1007/s00521-012-1048-5

Lekkas, 2010, Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological disease, Artif Intell Med, 50, 117, 10.1016/j.artmed.2010.05.007

Übeyli, 2009, Modified mixture of experts for diabetes diagnosis, J Med Syst, 33, 299, 10.1007/s10916-008-9191-3

Polat, 2007, An improved approach to medical data sets classification: artificial immune recognition system with fuzzy resource allocation mechanism, Expert Syst, 24, 252, 10.1111/j.1468-0394.2007.00432.x

Chikh, 2012, Diagnosis of diabetes diseases using an Artificial Immune Recognition System2 (AIRS2) with fuzzy K-nearest neighbor, J Med Syst, 36, 2721, 10.1007/s10916-011-9748-4

Polat, 2007, An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes diseases, Digit Signal Process, 17, 702, 10.1016/j.dsp.2006.09.005

Christopher, 2015, A swarm optimization approach for clinical knowledge mining, Comput Methods Prog Biomed, 121, 137, 10.1016/j.cmpb.2015.05.007

Seera, 2014, A hybrid intelligent system for medical data classification, Expert Syst Appl, 41, 2239, 10.1016/j.eswa.2013.09.022

Aslam, 2013, Feature generation using genetic programming with comparative partner selection for diabetes classification, Expert Syst Appl, 40, 5402, 10.1016/j.eswa.2013.04.003

Patil, 2010, Hybrid prediction model for type-2 diabetic patients, Expert Syst Appl, 37, 8102, 10.1016/j.eswa.2010.05.078

Purwar, 2015, Hybrid prediction model with missing value imputation for medical data, Expert Syst Appl, 42, 5621, 10.1016/j.eswa.2015.02.050

Su, 2006, Knowledge acquisition through information granulation for imbalanced data, Expert Syst Appl, 31, 531, 10.1016/j.eswa.2005.09.082

Õ..zbaki, 2011, Exploring comprehensible classification rules from trained neural networks integrated with a time-varying binary particle swarm optimizer, Eng Appl Artif Intell, 24, 491, 10.1016/j.engappai.2010.11.008

Rodríguez, 2011, Efficient distributed genetic algorithm for rule extraction, Appl Soft Comput, 11, 733, 10.1016/j.asoc.2009.12.035

Gonçalves, 2006, Invited hierarchical neuro-fuzzy model for pattern classification and rule extraction in diabetes, IEEE Trans Syst Man Cyber-Part C: Appl Rev, 36, 236, 10.1109/TSMCC.2004.843220

Chang, 2004, Evolutionary design of a fuzzy classifier from data, IEEE Trans Syst Man Cyber-Part B: Cyber, 34, 1894, 10.1109/TSMCB.2004.831160

Abonyi, 2003, Data-driven generation of compact, accurate, and linguistically sound fuzzy classifiers based on a decision-tree initialization, Int J Approx Reason, 32, 1, 10.1016/S0888-613X(02)00076-2

Salari, 2014, A novel hybrid classification model of genetic algorithms, modified k-nearest neighbor and developed backpropagation neural network, PLoS One, 9, e112987, 10.1371/journal.pone.0112987

Ozcift, 2011, Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms, Comput Methods Prog Biomed, 104, 443, 10.1016/j.cmpb.2011.03.018

Luukka, 2007, Similarity classifier using similarity measure derived fromYu׳s norms in classification of medical data sets, Comput Biol Med, 37, 1133, 10.1016/j.compbiomed.2006.10.005

Polat, 2008, A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine, Expert Syst Appl, 34, 482, 10.1016/j.eswa.2006.09.012

Kahramanli, 2008, Design of a hybrid system for the diabetes and heart diseases, Expert Syst Appl, 35, 82, 10.1016/j.eswa.2007.06.004

Temurtas, 2009, A comparative study on diabetes disease diagnosis using neural networks, Expert Syst Appl, 36, 8610, 10.1016/j.eswa.2008.10.032

Übeyli, 2010, Automatic diagnosis of diabetes using adaptive neuro-fuzzy inference systems, Expert Syst, 27, 259, 10.1111/j.1468-0394.2010.00527.x

Örkcü, 2011, Comparing performances of backpropagation and genetic algorithms in the data classification, Expert Syst Appl, 38, 3703, 10.1016/j.eswa.2010.09.028

Luukka, 2011, Feature selection using fuzzy entropy measures with similarity classifier, Expert Syst Appl, 38, 4600, 10.1016/j.eswa.2010.09.133

Isa, 2011, Clustered-Hybrid Multilayer Perceptron network for pattern recognition application, Appl Soft Comput, 11, 1457, 10.1016/j.asoc.2010.04.017

Belle, 2014, White box radial basis function classifiers with component selection for clinical prediction models, Artif Intell Med, 60, 53, 10.1016/j.artmed.2013.10.001

Hoffmann, 2007, Inferring descriptive and approximate fuzzy rules for credit scoring using evolutionary algorithms, Eur J Oper Res, 177, 540, 10.1016/j.ejor.2005.09.044

Wang, 2015, An improved electromagnetism-like mechanism algorithm and its application to the prediction of diabetes mellitus, J Biomed Inf, 54, 220, 10.1016/j.jbi.2015.02.001

Seera, 2015, A hybrid FAM–CART model and its application to medical data classification, Neural Comput Appl, 26, 1799, 10.1007/s00521-015-1852-9

Mohapatra, 2015, An improved cuckoo search based extreme learning machine for medical data classification, Swarm Evol Comput, 24, 25, 10.1016/j.swevo.2015.05.003

Feng, 2015, Variable coded hierarchical fuzzy classification model using DNA coding and evolutionary programming, Appl Math Model, 39, 7401, 10.1016/j.apm.2015.03.004

Napierala, 2012, BRACID: a comprehensive approach to learning rules from imbalanced data, J Intell Inf Syst, 39, 335, 10.1007/s10844-011-0193-0

Setiono, 2008, Recursive neural network rule extraction for data with mixed attributes, IEEE Trans Neural Netw, 19, 299, 10.1109/TNN.2007.908641

Quinlan JR. C4.5: programs for machine learning. Morgan Kaufmann Series in Machine Learning. San Mateo, California: Morgan Kaufman, Inc.; 1993.

Hayashi, 2016, Recursive-Rule Extraction algorithm with J48graft and applications to generating credit scores, J Artif Intell Soft Comput Res, 6, 35, 10.1515/jaiscr-2016-0004

〈http://fiji.sc/javadoc/weka/classifiers/trees/J48graft.html〉; [Last accessed 30.09.15].

Webb GI. Decision tree grafting from the all-tests-but-one partition, In: Proc. 16th international joint conference on artificial intelligence (IJCAI), 2; 1999. p. 702–07.

Setiono, 1997, A penalty-function approach for pruning feedforward neural networks, Neural Comp, 9, 185, 10.1162/neco.1997.9.1.185

Setiono R. Sampling selection and neural network rule extraction for credit scoring. In: Proceedings of the 43rd decision sciences institutes annual meeting; 2012. p. 1280–90.

Setiono, 2015, Using sample selection to improve accuracy and simplicity of rules extracted from neural networks for credit scoring applications, Int J Comput Intell Appl, 14, 1550021-1, 10.1142/S1469026815500212

University of California, Irvine Learning Repository, 〈http://archive/ics.uci/edu/m/〉; [last accessed 01.10.15].

Knowler, 1979, Islet cell antibodies and diabetes mellitus in Pima Indians, Diabetologia, 17, 161, 10.1007/BF01219743

Smith JW, et al. Using the ADAP learning algorithm to forecast the onset of diabetes mellitus. In: Proc. Twelfth Annual Symposium Comput. Applications Medical Care; 1988. p. 261–65.

Witten, 1999

Quinlan, 1986, Induction of decision trees, Mach Learn, 1, 81, 10.1007/BF00116251

Webb GI. Decision tree grafting, learning. In: Proc. IJCAI’97 15th international conference on artificial intelligence (IJCAI) 2; 1997. p. 846–85.

Purwar, 2015, Hybrid prediction model with missing value information for medical data, Expert Syst Appl, 42, 5621, 10.1016/j.eswa.2015.02.050

Holt, 2006

Breault, 2001, Data mining diabetic databases: are rough sets a useful addition?, 33, 51

Gagliardi, 2011, Instance-based classifiers applied to medical databases: Diagnosis and knowledge extraction, Artif Intell Med, 52, 123, 10.1016/j.artmed.2011.04.002

Breault, 2002, Data mining a diabetic data warehouse, Artif Intell Med, 26, 37, 10.1016/S0933-3657(02)00051-9

Amine Chikh, 2012, Diagnosis of diabetes diseases using an Artificial Immune Recognition System2 (AIRS2) with fuzzy K-nearest neighbor, J Med Syst, 36, 2721, 10.1007/s10916-011-9748-4

Salzberg, 1997, On comparing classifiers: pitfalls to avoid and a recommended approach, Data Min Knowl Disco, 1, 317, 10.1023/A:1009752403260

Marqués, 2013, On the suitability of resampling techniques for the class imbalance problem in credit scoring, J Oper Res Soc, 64, 1060, 10.1057/jors.2012.120

American Diabetes Association, 2015, Standards of medical care in diabetes-2015, Diabetes Care, 38, S1

UK Prospective Diabetes Study (UKPDS) Group, 1998, Intensive blood-glucose control with sulphonylureas or insulin compared with conventional treatment and risk of complications in patients with type 2 diabetes (UKPDS 33), Lancet, 352, 837, 10.1016/S0140-6736(98)07019-6

UK Prospective Diabetes Study (UKPDS) Group, 1998, Effect of intensive blood-glucose control with metformin on complications in overweight patients with type 2 diabetes (UKPDS 34), Lancet, 352, 854, 10.1016/S0140-6736(98)07037-8

The DCCT Research Group, 1992, The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus, N Eng J Med, 329, 977

Geneuth, 2008, The UKPDS and its global impact, Diabet Med, 25, 57, 10.1111/j.1464-5491.2008.02504.x

Kilpatrick, 2010, ABCD position statement on haemoglobin A1c for the diagnosis of diabetes, Pr Diabetes, 27, 1

Nathan, 2007, Impaired fasting glucose and impaired glucose tolerance, Diabetes Care, 30, 753, 10.2337/dc07-9920

Cavagnolli, 2011, HbA1c measurement for the diagnosis of diabetes: is it enough?, Diabet Med, 28, 31, 10.1111/j.1464-5491.2010.03159.x

Hayashi, 2013, Patterns of insulin concentration during the OGGT predict the risk of type 2 diabetes in Japanese Americans, Diabetes Care, 36, 1229, 10.2337/dc12-0246

Araneta, 2015, Optimum BMI cut points to screen Asian Americans for type 2 diabetes, Diabetes Care, 38, 814, 10.2337/dc14-2071

Hsia, 2015, Impact of lowering BMI cut points as recommended in the revised American Diabetes Association׳s Standards of Medical Care in Diabetes-2015 on diabetes screening in Asian Americans, Diabetes Care, 38, 2166, 10.2337/dc15-0299

Boffeta, 2011, Body mass index and diabetes in Asia: a cross-sectional pooled analysis of 900,000 individuals in the Asia cohort consortium, Plos One, 6, e19930, 10.1371/journal.pone.0019930

Fortuny, 2015, Active learning-based pedagogical rule extraction, IEEE Trans Neural Netw Learn Syst, 26, 2664, 10.1109/TNNLS.2015.2389037