An approach towards missing data management using improved GRNN-SGTM ensemble method

Ivan Izonin1, Roman Tkachenko2, Volodymyr Verhun3, Khrystyna Zub4
1Department of Artificial Intelligence, Institute of Computer Sciences and Information Technologies, Lviv Polytechnic National University, Lviv, Ukraine
2Department of Publishing Information Technologies, Institute of Computer Sciences and Information Technologies, Lviv Polytechnic National University, Lviv, Ukraine
3Department of Automated Control Systems, Institute of Computer Sciences and Information Technologies, Lviv Polytechnic National University, Lviv, Ukraine
4Center of Information Support, Lviv Polytechnic National University, Lviv, Ukraine

Tài liệu tham khảo

Chukhrai, 2013, Diagnosing the efficiency of cost management of innovative processes at machine-building enterprises, Actual Problems of Economics., 146, 75 Chukhrai, 2013, Essence and classification of assessment methods for marketing strategies’ efficiency of cost-oriented enterprises, Actual Problems of Economics., 145, 118 Tkachenko, 2020, An Approach towards Increasing Prediction Accuracy for the Recovery of Missing IoT Data Based on the GRNN-SGTM Ensemble, Sensors., 20, 2625, 10.3390/s20092625 Wubetie, 2017, Missing data management and statistical measurement of socio-economic status: application of big data, J Big Data., 4, 1, 10.1186/s40537-017-0099-y Schlomer, 2010, Best practices for missing data management in counseling psychology, J Couns Psychol., 57, 1, 10.1037/a0018082 Boreiko, 2016, Developing a controller for registering passenger flow of public transport for the “smart” city system, Eastern-European Journal of Enterprise Technologies., 6, 40, 10.15587/1729-4061.2016.84143 Kang, 2013, The prevention and handling of the missing data, Korean J Anesthesiol., 64, 402, 10.4097/kjae.2013.64.5.402 N.B. Shakhovska, Y.J. Bolubash, O.M. Veres, Big data federated repository model, in: The Experience of Designing and Application of CAD Systems in Microelectronics, 2015: pp. 382–384. https://doi.org/10.1109/CADSM.2015.7230882. Babichev, 2018, An Evaluation of the Information Technology of Gene Expression Profiles Processing Stability for Different Levels of Noise Components, Data., 3, 48, 10.3390/data3040048 C.M. Salgado, C. Azevedo, H. Proença, S.M. Vieira, Missing Data, in: MIT Critical Data (Ed.), Secondary Analysis of Electronic Health Records, Springer International Publishing, Cham, 2016: pp. 143–162. https://doi.org/10.1007/978-3-319-43742-2_13. Handbook of Data Analysis, SAGE Publications Ltd. (2020). https://uk.sagepub.com/en-gb/eur/handbook-of-data-analysis/book209824 (accessed April 25, 2020). Hovorushchenko, 2018, Methodology of Evaluating the Sufficiency of Information for Software Quality Assessment According to ISO 25010, J. Inf. Organ. Sci. (Online), 42, 63, 10.31341/jios.42.1.4 McMahon, 2020, Approaches to Dealing With Missing Data in Railway Asset Management, IEEE Access, 8, 48177, 10.1109/ACCESS.2020.2978902 K. Zhong, Z. Song, P. Jain, P.L. Bartlett, I.S. Dhillon, Recovery Guarantees for One-hidden-layer Neural Networks, (2017). https://arxiv.org/abs/1706.03175v1 (accessed July 2, 2020). Tfwala, 2013, Prediction of Missing Flow Records Using Multilayer Perceptron and Coactive Neurofuzzy Inference System, ScientificWorldJournal., 2013, 10.1155/2013/584516 Worland, 2018, Improving predictions of hydrological low-flow indices in ungaged basins using machine learning, Environ. Modell. Software, 101, 169, 10.1016/j.envsoft.2017.12.021 B. Hsieh, T.C. Pratt, Field Data Recovery in Tidal System Using Artificial Neural Networks (ANNs), in: 2001. Nogueira, 2007, Comparison of Classifiers Efficiency on Missing Values Recovering: Application in a Marketing Database with Massive Missing Data, in, IEEE Symposium on Computational Intelligence and Data Mining, 2007, 66, 10.1109/CIDM.2007.368854 Silva-Ramírez, 2011, Missing value imputation on missing completely at random data using multilayer perceptrons, Neural Networks., 24, 121, 10.1016/j.neunet.2010.09.008 M. Smieja, Ł. Struski, J. Tabor, B. Zieliński, P. Spurek, Processing of missing data by neural networks, (2018). https://arxiv.org/abs/1805.07405v3 (accessed July 2, 2020). Tatar, 2015, Prediction of reservoir brine properties using radial basis function (RBF) neural network, Petroleum., 1, 349, 10.1016/j.petlm.2015.10.011 Wu, 2018, Missing data recovery using reconstruction in ocean wireless sensor networks, Comput. Commun., 132, 1, 10.1016/j.comcom.2018.09.007 Bodyanskiy, 2019, Fuzzy Clustering High-Dimensional Data Using Information Weighting, 385 Hu, 2019, Fuzzy Clustering of Incomplete Data by Means of Similarity Measures, 957 Subbotin, 2020, Radial-Basis Function Neural Network Synthesis on the Basis of Decision Tree, Opt. Mem. Neural Networks., 29, 7, 10.3103/S1060992X20010051 Carreira-Perpińn, 2011, Manifold Learning and Missing Data Recovery through Unsupervised Regression, 1014 Zhou, 2017, Recover Missing Sensor Data with Iterative Imputing Network, in, AAAI Workshops Turabieh, 2018, Dynamic L-RNN recovery of missing data in IoMT applications, Future Generation Computer Systems., 89, 575, 10.1016/j.future.2018.07.006 C. Shen Y. Zhang X. Guo X. Chen H. Cao J. Tang J. Li J. Liu Seamless GPS/Inertial Navigation System Based on Self-Learning Square-Root Cubature Kalman Filter IEEE Transactions on Industrial Electronics. 2020 1 1 10.1109/TIE.2020.2967671 Shen, 2019, Dual-optimization for a MEMS-INS/GPS system during GPS outages based on the cubature Kalman filter and neural networks, Mech. Syst. Sig. Process., 133, 10.1016/j.ymssp.2019.07.003 Oh, 2020, Convolutional neural network–based data recovery method for structural health monitoring, Structural Health Monitoring., 10.1177/1475921719897571 Du, 2019, A deep learning method for data recovery in sensor networks using effective spatio-temporal correlation data T. Ivek, D. Vlah, BlackBox: Generalizable Reconstruction of Extremal Values from Incomplete Spatio-Temporal Data, Undefined. (2020). /paper/BlackBox%3A-Generalizable-Reconstruction-of-Extremal-Ivek-Vlah/4466c4a53efd8c8d3794a138dc9ea86646007665 (accessed July 18, 2020). Lin, 2020, Data-driven missing data imputation in cluster monitoring system based on deep neural network, Appl Intell., 50, 860, 10.1007/s10489-019-01560-y Bodyanskiy, 2020, A Hybrid Cascade Neural Network with Ensembles of Extended Neo-Fuzzy Neurons and Its Deep Learning, 164 Bodyanskiy, 2019, A Hybrid Cascade Neuro-Fuzzy Network with Pools of Extended Neo-Fuzzy Neurons and its Deep Learning, Int. J. Appl. Math. Comput. Sci., 29, 477, 10.2478/amcs-2019-0035 Izonin, 2020, GRNN Approach Towards Missing Data Recovery Between IoT Systems, 445, 10.1007/978-3-030-29035-1_43 Specht, 1991, A general regression neural network, 2020 Al-Daoud, 2009, A Comparison Between Three Neural Network Models for Classification Problems, J. of Artificial Intelligence., 2, 56, 10.3923/jai.2009.56.64 Nishanth, 2013, A Computational Intelligence Based Online Data Imputation Method: An Application For Banking, J. Inf. Process. Syst., 9, 633, 10.3745/JIPS.2013.9.4.633 Zhang, 2020, Short-term wind speed prediction model based on GA-ANN improved by VMD, Renewable Energy, 156, 1373, 10.1016/j.renene.2019.12.047 I.A. Gheyas Novel Computationally Intelligent Machine Learning Algorithms for Data Mining and Knowledge Discovery 2009 Gheyas, 2010, A neural network-based framework for the reconstruction of incomplete data sets, Neurocomputing., 73, 3039, 10.1016/j.neucom.2010.06.021 Qiao, 2020, Application of generalized regression neural network optimized by fruit fly optimization algorithm for fracture toughness in a pearlitic steel, Eng. Fract. Mech., 235, 10.1016/j.engfracmech.2020.107105 Varanasi, 2020, A hybrid model of generalized regression neural network and radial basis function neural network for wind power forecasting in Indian wind farms, Journal of Statistics and Management Systems., 23, 49, 10.1080/09720510.2020.1721598 Izonin, 2019, An Extended-Input GRNN and its Application, Procedia Comput. Sci., 160, 578, 10.1016/j.procs.2019.11.044 Ravi, 2014, A new online data imputation method based on general regression auto associative neural network, Neurocomputing., 138, 106, 10.1016/j.neucom.2014.02.037 Al-Mahasneh, 2018, Evolving General Regression Neural Networks using Limited Incremental Evolution for Data-Driven Modeling of Non-linear Dynamic Systems, in, IEEE Symposium Series on Computational Intelligence (SSCI), 2018, 335, 10.1109/SSCI.2018.8628909 Bani-Hani, 2019, A Recursive General Regression Neural Network (R-GRNN) Oracle for classification problems, Expert Syst. Appl., 135, 273, 10.1016/j.eswa.2019.06.018 Ndaba, 2018, An Improved Generalized Regression Neural Network for Type II Diabetes Classification, 659 Vitynskiy, 2019, Aнcaмбль мepeж GRNN для poзв’язaння зaдaч peгpeciї з пiдвищeнoю тoчнicтю, Hayкoвий вicник HЛTУ Укpaїни., 29, 120 R. Tkachenko, I. Izonin, I. Dronyuk, M. Logoyda, P. Tkachenko, Recover Missing Sensor Data with GRNN-based Cascade Scheme, International Journal of Sensors, Wireless Communications and Control. (2020) 1–10. https://doi.org/(in press). Izonin, 2019, Recovery of Incomplete IoT Sensed Data using High-Performance Extended-Input Neural-Like Structure, Procedia Comput. Sci., 160, 521, 10.1016/j.procs.2019.11.054 UCI Machine Learning Repository: Air Quality Data Set, (n.d.). http://archive.ics.uci.edu/ml/datasets/air+quality (accessed March 17, 2019). De Vito, 2008, On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario, Sens. Actuators, B, 129, 750, 10.1016/j.snb.2007.09.060 Tkachenko, 2018, Development of the Non-Iterative Supervised Learning Predictor Based on the Ito Decomposition and SGTM Neural-Like Structure for Managing Medical Insurance Costs, Data., 3, 46, 10.3390/data3040046 Mishchuk, 2020, Missing Data Imputation Through SGTM Neural-Like Structure for Environmental Monitoring Tasks, 142 Tang, 2017, Random Forest Missing Data Algorithms, Stat Anal Data Min., 10, 363, 10.1002/sam.11348 Y. Freund, R.E. Schapire, A Short Introduction to Boosting, in: In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, Morgan Kaufmann, 1999: pp. 1401–1406. Moayedi, 2020, Optimization of ANFIS with GA and PSO estimating α ratio in driven piles, Engineering with Computers, 36, 227, 10.1007/s00366-018-00694-w Zhou, 2020, Employing artificial bee colony and particle swarm techniques for optimizing a neural network in prediction of heating and cooling loads of residential buildings, J. Cleaner Prod., 254, 10.1016/j.jclepro.2020.120082 Moayedi, 2018, Optimizing an ANN model with ICA for estimating bearing capacity of driven pile in cohesionless soil, Engineering with Computers, 34, 347, 10.1007/s00366-017-0545-7