IntelliHealth: A medical decision support application using a novel weighted multi-layer classifier ensemble framework

Journal of Biomedical Informatics - Tập 59 - Trang 185-200 - 2016
Saba Bashir1, Usman Qamar1, Farhan Hassan Khan1
1Computer Engineering Department, College of Electrical and Mechanical Engineering, National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan

Tóm tắt

Từ khóa


Tài liệu tham khảo

2015

Chen, 2015, OrderRex: clinical order decision support and outcome predictions by data-mining electronic medical records, J. Am. Med. Inform. Assoc.

Dua, 2011

Ahmad, 2015, Random ordinality ensembles: ensemble methods for multi-valued categorical data, Inf. Sci., 296, 75, 10.1016/j.ins.2014.10.064

Sluban, 2015, Relating ensemble diversity and performance: a study in class noise detection, Neurocomputing, 160, 120, 10.1016/j.neucom.2014.10.086

Moretti, 2015, Urban traffic flow forecasting through statistical and neural network bagging ensemble hybrid modeling, Neurocomputing, 10.1016/j.neucom.2014.08.100

Kim, 2015, Geometric mean based boosting algorithm with over-sampling to resolve data imbalance problem for bankruptcy prediction, Expert Syst. Appl., 42, 1074, 10.1016/j.eswa.2014.08.025

Kang, 2015, Multi-class classification via heterogeneous ensemble of one-class classifiers, Eng. Appl. Artif. Intell., 43, 35, 10.1016/j.engappai.2015.04.003

Kitakaze, 2015, Data mining as a powerful tool for creating novel drugs in cardiovascular medicine: the importance of a “back-and-forth loop” between clinical data and basic research, Cardiovasc. Drugs Ther., 1

Zhu, 2013, Biomedical text mining and its applications in cancer research, J. Biomed. Inform., 46, 200, 10.1016/j.jbi.2012.10.007

Shah, 2015, Clinical diabetes research using data mining: a Canadian perspective, Can. J. Diabetes, 39, 235, 10.1016/j.jcjd.2015.02.005

Lavrač, 1999, Selected techniques for data mining in medicine, Artif. Intell. Med., 16, 3, 10.1016/S0933-3657(98)00062-1

Kumar, 2012, Hepatitis prediction model based on data mining algorithm and optimal feature selection to improve predictive accuracy, Int. J. Comput. Appl., 51, 13

Thenmozhi, 2014, Heart disease prediction using classification with different decision tree techniques, Int. J. Eng. Res. Gen. Sci., 2

Chitra, 2013, Heart disease prediction system using supervised learning classifier, Int. J. Softw. Eng. Soft Comput., 3

Shouman, 2013, Integrating clustering with different data mining techniques in the diagnosis of heart disease, J. Comput. Sci. Eng., 20

Shouman, 2012, Integrating Naive Bayes and K-means clustering with different initial centroid selection methods in the diagnosis of heart disease patients, Comput. Sci. Inform. Technol., 125

S. Ghumbre, C. Patil, A. Ghatol, Heart disease diagnosis using support vector machine, in: International Conference on Computer Science and Information Technology (ICCSIT’), Pattaya, 2011.

Chaurasia, 2014, A novel approach for breast cancer detection using data mining techniques, Int. J. Innov. Res. Comput. Commun. Eng., 2

Ashfaq Ahmed, 2013, Comparative prediction performance with support vector machine and random forest classification techniques, Int. J. Comput. Appl., 69

Salama, 2012, Breast cancer diagnosis on three different datasets using multi-classifiers, Int. J. Comput. Inform. Technol., 01

Lavanya, 2012, Ensemble decision tree classifier for breast cancer data, Int. J. Inform. Technol. Converg. Serv. (IJITCS), 2

Lavanya, 2012, Analysis of feature selection with classification: breast cancer datasets, Ind. J. Comput. Sci. Eng. (IJCSE), 2

Gandhi, 2014, Diabetes prediction using feature selection and classification, Int. J. Adv. Eng. Res. Develop.

Stahl, 2014

Aslam, 2013, Feature generation using genetic programming with comparative partner selection for diabetes classification, Expert Syst. Appl., 10.1016/j.eswa.2013.04.003

M. Nirmala Devi, S. Appavu, U.V. Swathi, An amalgam KNN to predict diabetes mellitus, in: International Conference on Emerging Trends in Computing, Communication and Nanotechnology (ICE-CCN), 2013.

Christobel, 2012, The negative impact of missing value imputation in classification of diabetes dataset and solution for improvement, IOSR J. Comput. Eng. (IOSRJCE), 7, 5

Zolfaghari, 2012, Diagnosis of diabetes in female population of Pima Indian heritage with ensemble of BP neural network and SVM, IJCEM Int. J. Comput. Eng. Manage., 15

Lee, 2011, A fuzzy expert system for diabetes decision support application, IEEE Trans. Syst. Man Cybern. B Cybern., 41

Vijayarani, 2015, Liver disease prediction using SVM and Naïve Bayes algorithms, Int. J. Sci. Eng. Technol. Res. (IJSETR), 4

Jin, 2014, Decision factors on effective liver patient data prediction, Int. J. BioSci. BioTechnol., 6

Sugawara, 2012, Acute liver failure in Japan: definition, classification, and prediction of the outcome, J. Gastroenterol., 10.1007/s00535-012-0624-x

Kumar, 2013, Prediction of different types of liver diseases using rule based classification model, Technol. Healthcare, 21, 417

Ramana, 2012, Liver classification using modified rotation forest, Int. J. Eng. Res. Develop., 1, 17

Pushpalatha, 2014, Data model comparison for hepatitis diagnosis, Int. J. Emerg. Res. Manage. Technol., 3, 138

El Houby, 2013, Analysis of associative classification for prediction of HCV response to treatment, Int. J. Comput. Appl., 63

Karthikeyan, 2013, Analysis of classification algorithms applied to hepatitis patients, Int. J. Comput. Appl., 62

Yılmaz Kaya, Murat Uyar, A hybrid decision support system based on rough set and extreme learning machine for diagnosis of hepatitis disease, 2013.

Kumar, 2012, Hepatitis prediction model based on data mining algorithm and optimal feature selection to improve predictive accuracy, Int. J. Comput. Appl., 51

Díez-Pastor, 2015, Random balance: ensembles of variable priors classifiers for imbalanced data, Knowl.-Based Syst., 10.1016/j.knosys.2015.04.022

King, 2015, Ensemble learning methods for pay-per-click campaign management, Expert Syst. Appl., 42, 4818, 10.1016/j.eswa.2015.01.047

Parvin, 2015, Proposing a classifier ensemble framework based on classifier selection and decision tree, Eng. Appl. Artif. Intell., 37, 34, 10.1016/j.engappai.2014.08.005

Mendes-Moreira, 2015, Improving the accuracy of long-term travel time prediction using heterogeneous ensembles, Neurocomputing, 150, 428, 10.1016/j.neucom.2014.08.072

Park, 2014, Efficient implementation of class-based decomposition schemes for Naïve Bayes, Mach. Learn., 96, 295, 10.1007/s10994-013-5430-z

Hino, 2014, Non-parametric entropy estimators based on simple linear regression, Comput. Stat. Data Anal., 89, 72, 10.1016/j.csda.2015.03.011

Bose, 2015, Generalized quadratic discriminant analysis, Pattern Recogn., 48, 2676, 10.1016/j.patcog.2015.02.016

Lin, 2013, Double-bootstrapping source data selection for instance-based transfer learning, Pattern Recogn. Lett., 34, 1279, 10.1016/j.patrec.2013.04.012

Datta, 2015, Near-Bayesian support vector machines for imbalanced data classification with equal or unequal misclassification costs, Neural Networks, 70, 39, 10.1016/j.neunet.2015.06.005

Ibarguren, 2015, Coverage-based resampling: building robust consolidated decision trees, Knowl.-Based Syst., 79, 51, 10.1016/j.knosys.2014.12.023

Senan, 2015, Robustness analysis of uncertain dynamical neural networks with multiple time delays, Neural Networks, 10.1016/j.neunet.2015.07.001

Yang, 2015, Comparison of prediction models for coronary heart diseases in depression patients, Int. J. Multimedia Ubiq. Eng., 10, 257, 10.14257/ijmue.2015.10.3.24

Peter, 2015, Probabilistic classification for prediction of heart disease, Aust. J. Basic Appl. Sci., 9, 639

Kiruthika, 2014, An ill-identified classification to predict cardiac disease using data clustering, Int. J. Data Min. Tech. Appl., 03, 321

Bashir, 2015, BagMOOV: a novel ensemble for heart disease prediction bootstrap aggregation with multi-objective optimized voting, Australas. Phys. Eng. Sci. Med., 1

Sørensen, 2015, Long non-coding RNA expression profiles predict metastasis in lymph node-negative breast cancer independently of traditional prognostic markers, Breast Cancer Res., 10.1186/s13058-015-0557-4

Zand, 2015, A comparative survey on data mining techniques for breast cancer diagnosis and prediction, Ind. J. Fundam. Appl. Life Sci., 5, 4330

Chaurasia, 2014, Data mining techniques: to predict and resolve breast cancer survivability, Int. J. Comput. Sci. Mob. Comput., 3, 10

Kandhasamy, 2015, Performance analysis of classifier models to predict diabetes mellitus, Proc. Comput. Sci., 47, 10.1016/j.procs.2015.03.182

S. Bashir, U. Qamar, F.H. Khan, An efficient rule based classification of diabetes using ID3, C4.5 and CART ensemble, in: Frontier Information Technology, IEEE, Islamabad, Pakistan, 2015.

Tapak, 2013, Real-data comparison of data mining methods in prediction of diabetes in Iran, Healthcare Inform. Res., 19, 177, 10.4258/hir.2013.19.3.177

Karthikeyani, 2012, Comparative of data mining classification algorithm (CDMCA) in diabetes disease prediction, Int. J. Comput. Appl., 60

Julia, 2014, Liver patient classification using intelligent techniques, Int. J. Comput. Sci. Inform. Technol., 5, 5110

Jin, 2014, Decision factors on effective liver patient data prediction, Int. J. BioSci. BioTechnol., 6, 167

Sug, 2012, Improving the prediction accuracy of liver disorder disease with oversampling, Appl. Math. Electric. Comput. Eng., 331

Ramana, 2011, A critical study of selected classification algorithms for liver disease diagnosis, Int. J. Datab. Manage. Syst. (IJDMS), 3

Karthik, 2011, Classification and rule extraction using rough set for diagnosis of liver disease and its types, Adv. Appl. Sci. Res., 2, 334

Houby, 2014, A framework for prediction of response to HCV therapy using different data mining techniques, Adv. Bioinform., 2014, 10.1155/2014/181056

Neshat, 2012, Hepatitis disease diagnosis using hybrid case based reasoning and particle swarm optimization, Artif. Intell.

Khan, 2008, Evaluation of diagnostic accuracy of apri for prediction of fibros is in hepatitis c patients, J. Ayub. Med. Coll. Abbottabad.

K. Zolfaghar, N. Verbiest, J. Agarwal, N. Meadem, S.C. Chin, S.B. Roy, et al., Predicting risk-of-readmission for congestive heart failure patients: a multi-layer approach, 2013. Available from: <arXiv:1306.2094>.

M.A.A. Burungale, D.A. Zende, Survey of Large-scale Hierarchical Classification.

Ensemble Methods. <http://scikit-learn.org/stable/modules/ensemble.html> (accessed 05.10.15).

Ali, 2014, IDM-PhyChm-Ens: intelligent decision making ensemble methodology for classification of human breast cancer using physicochemical properties of amino acids, Amino Acids, 46, 977, 10.1007/s00726-013-1659-x

Majid, 2014, Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines, Comput. Methods Programs Biomed., 113, 792, 10.1016/j.cmpb.2014.01.001

Ali, 2015, Can-Evo-Ens: classifier stacking based evolutionary ensemble system for prediction of human breast cancer using amino acid sequences, J. Biomed. Inform., 256, 10.1016/j.jbi.2015.01.004

Majid, 2014, HBC-Evo: predicting human breast cancer by exploiting amino acid sequence based feature spaces and evolutionary ensemble, Amino Acids, 47, 217, 10.1007/s00726-014-1871-3

Pattekari, 2012, Prediction system for heart disease using Naïve Bayes, Int. J. Adv. Comp. Math. Sci., 3, 290

Aruna, 2011, Knowledge based analysis of various statistical tools in detecting breast cancer, CCSEA, CS IT, 02, 37

Kousarrizi, 2012, An experimental comparative study on thyroid disease diagnosis based on feature subset selection and classification, Int. J. Electric. Comput. Sci. IJECS-IJENS, 12

Rajkumar, 2010, Diagnosis of heart disease using data mining algorithm, Glob. J. Comput. Sci. Technol., 10

Polat, 2007, Automatic detection of heart disease using an artificial immune recognition system (AIRS) with fuzzy resource allocation mechanism and k-nn (nearest neighbour) based weighting preprocessing, Expert Syst. Appl., 32, 625, 10.1016/j.eswa.2006.01.027

Kurt, 2008, Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease, Expert Syst. Appl., 34, 366, 10.1016/j.eswa.2006.09.004

Liao, 2007, Logistic regression for disease classification using microarray data: model selection in a large pand small n case, Bioinformatics, 23, 1945, 10.1093/bioinformatics/btm287

Ma, 2006, Modified glomerular filtration rate estimating equation for Chinese patients with chronic kidney disease, J. Am. Soc. Nephrol., 10.1681/ASN.2006040368

Dubberke, 2008, Short- and long-term attributable costs of Clostridium difficile-associated disease in nonsurgical inpatients, Clin. Infect. Dis., 46, 497, 10.1086/526530

Knaus, 1985, APACHE II: a severity of disease classification system, Crit. Care Med., 3, 10.1097/00003246-198510000-00009

Wilson, 1998, Prediction of coronary heart disease using risk factor categories, Am. Heart Assoc., 1837

B. Ster, A. Dobnikar, Neural networks in medical diagnosis: comparison with other methods, in: Proceedings of the International Conference on Engineering Applications of Neural Networks, 1996, pp. 427–430.

Georgiou-Karistianis, 2013, Automated differentiation of pre-diagnosis Huntington’s disease from healthy control individuals based on quadratic discriminant analysis of the basal ganglia: the IMAGE-HD study, Neurobiol. Dis., 51, 82, 10.1016/j.nbd.2012.10.001

Zhang, 2015, Identification of protein coding regions in the human genome by quadratic discriminant analysis, Proc. Natl. Acad. Sci. U.S.A., 94, 10.1073/pnas.94.2.565

Maroco, 2011, Data mining methods in the prediction of dementia: a real-data comparison of the accuracy, sensitivity and specificity of linear discriminant analysis, logistic regression, neural networks, support vector machines, classification trees and random forests, BMC Res. Notes, 4, 299, 10.1186/1756-0500-4-299

Drent, 1993, Differences in BAL fluid variables in interstitial lung diseases evaluated by discriminant analysis, Eur. Respir. J., 10.1183/09031936.93.06060803

Srivastava, 2007, Bayesian quadratic discriminant analysis, J. Mach. Learn. Res., 8, 1277

Das, 2009, Effective diagnosis of heart disease through neural networks ensembles, Expert Syst. Appl., 7675, 10.1016/j.eswa.2008.09.013

Chen, 2011, A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis, Expert Syst. Appl., 38, 9014, 10.1016/j.eswa.2011.01.120

V.N. Chunekar, H.P. Ambulgekar, Approach of neural network to diagnose breast cancer on three different data set, in: International Conference on Advances in Recent Technologies in Communication and Computing, 2009, pp. 893–895.

Kahramanli, 2008, Design of a hybrid system for the diabetes and heart diseases, Expert Syst. Appl., 35, 82, 10.1016/j.eswa.2007.06.004

M.C. Tu, D. Shin, D. Shin, Effective diagnosis of heart disease through bagging approach, in: 2nd International Conference on Biomedical Engineering and Informatics, 2009, pp. 1–4.

Ricci, 2011