Feedback on a shared big dataset for intelligent TBM Part I: Feature extraction and machine learning methods

Underground Space (China) - Tập 11 - Trang 1-25 - 2023
Jian-Bin Li1, Zu-Yu Chen2, Xu Li3, Liu-Jie Jing4, Yun-Pei Zhang2, Hao-Han Xiao2, Shuang-Jing Wang4, Wen-Kun Yang5, Lei-Jie Wu3, Peng-Yu Li4, Hai-Bo Li3, Min Yao3, Li-Tao Fan6
1China Railway Group Co. Ltd., Beijing 100089, China
2Department of Geotechnical Engineering, China Institute of Water Resources and Hydropower Research, Beijing 100038, China
3Key Laboratory of Urban Underground Engineering, Ministry of Education, Beijing Jiaotong University, Beijing 100044, China
4China Railway Engineering Equipment Group Co., Ltd., Zhengzhou, Henan 450016, China
5School of Civil Engineering, Southeast University, Nanjing, 211189, China
6Xi'an University of Technology, Xi'an 710048, China

Tài liệu tham khảo

Altman, 1992, An introduction to kernel and nearest-neighbor nonparametric regression, The American Statistician, 46, 175 Ayawah, 2022, A review and case study of Artificial intelligence and Machine learning methods used for ground condition prediction ahead of tunnel boring Machines, Tunnelling and Underground Space Technology, 125, 10.1016/j.tust.2022.104497 Bao, 2019, Computer vision and deep learning–based data anomaly detection method for structural health monitoring, Structural Health Monitoring, 18, 401, 10.1177/1475921718757405 Barton, 2000 Benesty, J., Chen, J., Huang, Y., & Cohen, I. (2009). Pearson correlation coefficient. In J. Benesty, & W. Kellermann, Spriner Topics in Signal Processing, (Vol. 2): Noise reduction in speech processing (pp. 1–4). Springer, Berlin, Heidelberg. Benhar, 2020, Data preprocessing for heart disease classification: A systematic literature review, Computer Methods and Programs in Biomedicine, 195, 10.1016/j.cmpb.2020.105635 Bo, 2022, Real-time hard-rock tunnel prediction model for rock mass classification using CatBoost integrated with Sequential Model-Based Optimization, Tunnelling and Underground Space Technology, 124, 10.1016/j.tust.2022.104448 Booth, 2006, Hydrologic variability of the Cosumnes River floodplain. San Francisco Estuary and Watershed, Science, 4, Article 2 Breiman, 2001, Random forests. Machine learning, 45, 5, 10.1023/A:1010933404324 Bruland, A. (1998). Hard Rock Tunnel Boring. [Doctoral dissertation, Norwegian University of Science and Technology]. Trondheim, Norway. Carter, 2009, A comparison of two boxplot methods for detecting univariate outliers which adjust for sample size and asymmetry, Statistical Methodology, 6, 604, 10.1016/j.stamet.2009.07.001 Chandrashekar, 2014, A survey on feature selection methods, Computers & Electrical Engineering, 40, 16, 10.1016/j.compeleceng.2013.11.024 Chawla, 2002, SMOTE: Synthetic minority over-sampling technique, Journal of artificial intelligence research, 16, 321, 10.1613/jair.953 Chen, 2020 Chen, 2021, Diagnosing tunnel collapse sections based on TBM tunneling big data and deep learning: A case study on the Yinsong Project, China. Tunnelling and Underground Space Technology, 108 Cortes, 1995, Support-vector networks. Machine learning, 20, 273 Cunningham, 2008, Supervised learning, 21 Dai, 1992 De'ath, 2000, Classification and regression trees: A powerful yet simple technique for ecological data analysis, Ecology, 81, 3178, 10.1890/0012-9658(2000)081[3178:CARTAP]2.0.CO;2 Delisio, 2013, Analysis and prediction of TBM performance in blocky rock conditions at the Lötschberg Base Tunnel, Tunnelling and Underground Space Technology, 33, 131, 10.1016/j.tust.2012.06.015 Donoho, 1995, De-noising by soft-thresholding, IEEE transactions on information theory, 41, 613, 10.1109/18.382009 Fawcett, 2006, An introduction to ROC analysis, Pattern recognition letters, 27, 861, 10.1016/j.patrec.2005.10.010 Feng, 2021, Tunnel boring machines (TBM) performance prediction: A case study using big data and deep learning, Tunnelling and Underground Space Technology, 110, 10.1016/j.tust.2020.103636 Fix, 1989, Discriminatory analysis. Nonparametric discrimination: Consistency properties, International Statistical Review/Revue Internationale de Statistique, 57, 238 Freund, Y., & Schapire, R. E. (1996). Experiments with a new boosting algorithm. In Proceedings of the Thirteenth International Conference on Machine Learning, 96, 148–156. Friedman, 2001, Greedy function approximation: A gradient boosting machine, Annals of statistics, 29, 1189, 10.1214/aos/1013203451 Gong, 2006, Numerical modelling of the effects of joint spacing on rock fragmentation by TBM cutters, Tunnelling and Underground Space Technology, 21, 46, 10.1016/j.tust.2005.06.004 Gong, 2006, Numerical simulation of rock fragmentation process induced by two TBM cutters and cutter spacing optimization, Tunnelling and Underground Space Technology, 21, 263, 10.1016/j.tust.2005.12.124 Gong, 2007, In situ TBM penetration tests and rock mass boreability analysis in hard rock tunnels, Tunnelling and underground space technology, 22, 303, 10.1016/j.tust.2006.07.003 Gong, 2021, Development and application of cutterhead working status monitoring system for shield TBM tunnelling, Rock Mechanics and Rock Engineering, 54, 1731, 10.1007/s00603-021-02380-8 Goutte, 2005, A probabilistic interpretation of precision, recall and F-score, with implication for evaluation, 345 Guo, 2022, Intelligent assistant driving method for tunnel boring machine based on big data, Acta Geotechnica, 17, 1019, 10.1007/s11440-021-01327-1 Guo, 2022, Advance prediction of collapse for TBM tunneling using deep learning method, Engineering Geology, 299, 10.1016/j.enggeo.2022.106556 2008, Vol. 207 Han, H., Wang, W. Y., & Mao, B. H. (2005, August). Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning. In International conference on intelligent computing (pp. 878–887). Springer, Berlin, Heidelberg. Hassanpour, 2010, TBM performance analysis in pyroclastic rocks: A case history of Karaj water conveyance tunnel, Rock mechanics and rock engineering, 43, 427, 10.1007/s00603-009-0060-2 Hastie, 2009, Multi-class adaboost. Statistics and its, Interface, 2, 349 He, 2009, Learning from imbalanced data, IEEE Transactions on Knowledge and Data Engineering, 21, 1263, 10.1109/TKDE.2008.239 Hinton, 2006, A fast learning algorithm for deep belief nets, Neural computation, 18, 1527, 10.1162/neco.2006.18.7.1527 Hinton, 2006, Reducing the dimensionality of data with neural networks, science, 313, 504, 10.1126/science.1127647 Hochreiter, 1997, Long short-term memory, Neural computation, 9, 1735, 10.1162/neco.1997.9.8.1735 Hou, 2020, Prediction of TBM tunnelling parameters based on IPSO-BP hybrid model, Chinese Journal of Rock Mechanics and Engineering, 39, 1648 Hou, 2022, Early warning of tunnel collapse based on Adam-optimised long short-term memory network and TBM operation parameters, Engineering Applications of Artificial Intelligence, 112, 10.1016/j.engappai.2022.104842 Hou, 2022, Real-time prediction of rock mass classification based on TBM operation big data and stacking technique of ensemble learning, Journal of Rock Mechanics and Geotechnical Engineering, 14, 123, 10.1016/j.jrmge.2021.05.004 Huang, 2021, BIM, machine learning and computer vision techniques in underground construction: Current status and future perspectives, Tunnelling and Underground Space Technology, 108, 10.1016/j.tust.2020.103677 Huang, 2022, Development and in-situ application of a real-time cutting tool forces monitoring system in TBM tunnelling, Tunnelling and Underground Space Technology, 124, 10.1016/j.tust.2022.104453 Jiao, M. (2017). Test and analysis of the first large-diameter open type TBM performance. [Master thesis, Shijiazhuang Tiedao University]. Shijiazhuang, China. (in Chinese). Jing, 2019, A case study of TBM performance prediction using field tunnelling tests in limestone strata, Tunnelling and Underground Space Technology, 83, 364, 10.1016/j.tust.2018.10.001 Jing, 2021, A TBM advance rate prediction method considering the effects of operating factors, Tunnelling and Underground Space Technology, 107, 10.1016/j.tust.2020.103620 Khalid, 2014, A survey of feature selection and feature extraction techniques in machine learning, 372 Klein, S., Schmoll, M., & Avery, T. (1995). TBM performance at four hard rock tunnels in California. In Proceedings of the Rapid Excavation and Tunneling Conference (pp. 61–76). Society for Mining, Metallogy & Exploration, Inc. Kleinlein, 2019, Persistence of data-driven knowledge to predict breast cancer survival, International Journal of Medical Informatics, 129, 303, 10.1016/j.ijmedinf.2019.06.018 LeCun, 2015, Deep learning, Nature, 521, 436, 10.1038/nature14539 Li, 2016, Application of outlier mining in insider identification based on boxplot method, Procedia Computer Science, 91, 245, 10.1016/j.procs.2016.07.069 Li, 2018, Mechanism and numerical analysis of cutting rock and soil by TBM cutting tools, Tunnelling and Underground Space Technology, 81, 428, 10.1016/j.tust.2018.08.015 Li, 2020 Li, 2020, TBM tunneling parameters prediction method based on clustering classification of rock mass, Chinese Journal of Rock Mechanics and Engineering, 39, 3326 Li, 2021, TBM tunneling parameters prediction based on Locally Linear Embedding and Support Vector Regression, Journal of Zhejiang University (Engineering Science), 55, 1426 Li, 2021, Advanced prediction of tunnel boring machine performance based on big data, Geoscience Frontiers, 12, 331, 10.1016/j.gsf.2020.02.011 Liang, J. (2020). Construction of TBM operation parameter decision optimization model based on genetic algorithm. [Master thesis, Dalian University of Technology]. Dalian, China. (in Chinese). Likas, 2003, The global k-means clustering algorithm, Pattern recognition, 36, 451, 10.1016/S0031-3203(02)00060-2 Liu, 2019, Improved support vector regression models for predicting rock mass parameters using tunnel boring machine driving data, Tunnelling and Underground Space Technology, 91, 10.1016/j.tust.2019.04.014 Liu, 2020, Prediction of rock mass parameters in the TBM tunnel based on BP neural network integrated simulated annealing algorithm, Tunnelling and Underground Space Technology, 95, 10.1016/j.tust.2019.103103 Liu, 2021, Intelligent decision method for main control parameters of tunnel boring machine based on multi-objective optimization of excavation efficiency and cost, Tunnelling and Underground Space Technology, 116, 10.1016/j.tust.2021.104054 1998, Vol. 453 Liu, H. (2021). Analysis of characteristics of TBM tunneling data for Yinsong project. [Master thesis, Beijing Jiaotong University]. Beijing, China. (in Chinese). Liu, 2019, Background classification method based on deep learning for intelligent automotive radar target detection, Future Generation Computer Systems, 94, 524, 10.1016/j.future.2018.11.036 Liu, 2021, Back analysis of the TBM collapse section based on convolutional neural networks, Chinese Journal of Solid Mechanics, 42, 287 Liu, 2021, Hard-rock tunnel lithology prediction with TBM construction big data using a global-attention-mechanism-based LSTM network, Automation in Construction, 125, 10.1016/j.autcon.2021.103647 Loh, 2011, Classification and regression trees, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1, 14 Loh, 2014, Fifty years of classification and regression trees, International Statistical Review, 82, 329, 10.1111/insr.12016 Luo, 2018, Advance rate of TBM based on field boring data, Journal of Zhejiang University (Engineering Science), 52, 1566 Mao, 2021, Identification of TBM surrounding rock based on Markov process and deep neural network, Journal of Zhejiang University (Engineering Science), 55 Miao, 2016, A survey on feature selection, Procedia Computer Science, 91, 919, 10.1016/j.procs.2016.07.111 Ng, A., Jordan, M., & Weiss, Y. (2001). On spectral clustering: Analysis and an algorithm. In Proceedings of 14th International Conference on Neural Information Processing Systems (NIPS 2001), 849–856. Ong, 2016, Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM2. 5, Neural Computing and Applications, 27, 1553, 10.1007/s00521-015-1955-3 Ozdemir, L. (1977). Development of theoretical equations for predicting tunnel boreability. [Doctoral dissertation, Colorado School of Mines]. Colorado, USA. Qi, 2019, Key technology of overlength pressure tunnel in water supply project in Central City of Jilin Province, Tunnel Construction, 39, 684 Ren, 2022, Genetic algorithm-assisted an improved AdaBoost double-layer for oil temperature prediction of TBM, Advanced Engineering Informatics, 52, 10.1016/j.aei.2022.101563 Ren, P. (2019). Research on the Data Cleansing Methods for Bridge Monitoring Data Based on Big-Data Platform. [Master thesis, Southeast University]. Nanjing, China. (in Chinese). Rodriguez, 2009, Sensitivity analysis of k-fold cross validation in prediction error estimation, IEEE transactions on pattern analysis and machine intelligence, 32, 569, 10.1109/TPAMI.2009.187 Rostami, J. (1997). Development of a force estimation model for rock fragmentation with disc cutters through theoretical modeling and physical measurement of crushed zone pressure. [Doctoral dissertation, Golden: Colorado School of Mines]. Colorado, USA. Rostami, 1993, New model for performance production of hard rock TBMs, Proceedings - Rapid Excavation and Tunneling Conference, 793 Sagi, 2018, Ensemble learning: A survey, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8, e1249 Salimi, 2018, Examining feasibility of developing a rock mass classification for hard rock TBM application using non-linear regression, regression tree and generic programming, Geotechnical and Geological Engineering, 36, 1145 Sanio, 1985, Prediction of the performance of disc cutters in anisotropic rock, International Journal of Rock Mechanics and Mining Science & Geomechanics Abstracts, 22, 153, 10.1016/0148-9062(85)93229-2 Shahrour, 2021, Use of soft computing techniques for tunneling optimization of tunnel boring machines, Underground Space, 6, 233, 10.1016/j.undsp.2019.12.001 Sheil, B. B., Suryasentana, S. K., Mooney, M. A., & Zhu, H. (2020). Machine learning to inform tunnelling operations: Recent advances and future trends. IN Proceedings of the Institution of Civil Engineers-Smart Infrastructure and Construction, 173(4), 74–95. Shen, T., Zhou, T., Long, G., Jiang, J., Pan, S., & Zhang, C. (2018). Disan: Directional self-attention network for rnn/cnn-free language understanding. In Proceedings of the AAAI conference on artificial intelligence (Vol. 32, No. 1). Sun, 2018, Dynamic load prediction of tunnel boring machine (TBM) based on heterogeneous in-situ data, Automation in Construction, 92, 23, 10.1016/j.autcon.2018.03.030 Sun, 2009, Classification of imbalanced data: A review, International Journal of Pattern Recognition and Artificial Intelligence, 23, 687, 10.1142/S0218001409007326 Tang, 2019, Convolutional neural network-based data anomaly detection method using multiple information for structural health monitoring, Structural Control and Health Monitoring, 26, e2296, 10.1002/stc.2296 Tareen, 2019, Descriptive analysis and earthquake prediction using boxplot interpretation of soil radon time series data, Applied Radiation and Isotopes, 154, 10.1016/j.apradiso.2019.108861 Ministry of Water Resources of the People’s Republic of China (2009). Code for Engineering Geological Investigation of Water Resources and Hydropower, GB50487–2008. China Planning Press, Beijing, p. 262. (in Chinese). Thiyagalingam, 2022, Scientific machine learning benchmarks, Nature Reviews Physics, 4, 413, 10.1038/s42254-022-00441-7 Tuzlukov, 2018 von Luxburg, 2007, A tutorial on spectral clustering, Statistics and computing, 17, 395, 10.1007/s11222-007-9033-z Wang, C. (2018). Research of the relationship of rock type and machine parameters of TBM and its optimal decision method. [Master dissertation, ZheJiang University]. Zhejiang, China. (in Chinese). Wang, 2020, A mechanical method for predicting TBM penetration rates, Arabian Journal of Geosciences, 13, 1, 10.1007/s12517-020-05305-x Wang, 2021, Dynamic prediction of mechanized shield tunneling performance, Automation in Construction, 132, 10.1016/j.autcon.2021.103958 Wang, 2021, Big Data-Based Boring Indexes and Their Application during TBM Tunneling, Advances in Civil Engineering, 4, 1 Wu, 2021, Real-time rock mass condition prediction with TBM tunneling big data using a novel rock–machine mutual feedback perception method, Journal of Rock Mechanics and Geotechnical Engineering, 13, 1311, 10.1016/j.jrmge.2021.07.012 Xiao, 2021, Prediction of Shield Machine Attitude Based on Various Artificial Intelligence Technologies, Applied Sciences, 11, 10264, 10.3390/app112110264 Xiao, 2022, Prediction of shield machine posture using the GRU algorithm with adaptive boosting: A case study of Chengdu Subway project, Transportation Geotechnics, 37, 10.1016/j.trgeo.2022.100837 Xiao, 2022, Significance and methodology: Preprocessing the big data for machine learning on TBM performance, Underground Space, 7, 680, 10.1016/j.undsp.2021.12.003 Xu, 2021, Prediction of tunnel boring machine operating parameters using various machine learning algorithms, Tunnelling and Underground Space Technology, 109, 10.1016/j.tust.2020.103699 Xue, 2021, Rock fragmentation induced by a TBM disc-cutter considering the effects of joints: A numerical simulation by DEM, Computers and Geotechnics, 136, 10.1016/j.compgeo.2021.104230 Yagiz, S. (2002). Development of rock fracture and brittleness indices to quantify the effects of rock mass features and toughness in the CSM Model basic penetration for hard rock tunneling machines. [Doctoral dissertation, Colorado School of Mines]. Colorado, USA. Yao, 2006, Anomaly intrusion detection approach using hybrid MLP/CNN neural network, Vol. 2, 1095 Yin, 2022, Perception model of surrounding rock geological conditions based on TBM operational big data and combined unsupervised-supervised learning, Tunnelling and Underground Space Technology, 120, 10.1016/j.tust.2021.104285 Zhang, 2017, Artificial intelligence in the post deep learning era, Hangzhou S&T Magazine., 48, 41 Zhang, 2019, Prediction method of rock mass parameters based on tunnelling process of tunnel boring machine, Journal of Zhejiang University (Engineering Science), 53, 1977 Zhang, 2019, Prediction of geological conditions for a tunnel boring machine using big operational data, Automation in Construction, 100, 73, 10.1016/j.autcon.2018.12.022 Zhang, 2021, Modeling of Tunneling Total Loads Based on Symbolic Regression Algorithm, Applied Sciences, 11, 5671, 10.3390/app11125671 Zhang, 2023, Cross-project prediction for rock mass using shuffled TBM big dataset and knowledge-based machine learning methods, Science China Technological Sciences, 2023, 1 Zhang, 2021, TBM excavation parameter prediction model based on LS-SVM method, Journal of Hohai University (Natural Sciences), 49, 373 Zhou, 2020, Predicting boring parameters of TBM stable stage based on BLSTM networks combined with attention mechanism, Chinese Journal of Rock Mechanics and Engineering, 39, 3505 Zhou, 2022, Rock chip properties of TBM penetration in jointed rock masses based on an improved DICE2D simulation, Rock Mechanics and Rock Engineering, 55, 7547, 10.1007/s00603-022-03025-0 Zhu, 2020, Study on CART-based ensemble learning algorithms for predicting TBM tunneling parameters and classing surrounding rock masses, Chinese Journal of Rock Mechanics and Engineering, 39, 1860 Zhu, 2021, Performance Evaluation Indicator (PEI): A new paradigm to evaluate the competence of machine learning classifiers in predicting rockmass conditions, Advanced Engineering Informatics, 47, 10.1016/j.aei.2020.101232 Zuo, 2006, BDPCA plus LDA: A novel fast feature extraction technique for face recognition, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 36, 946, 10.1109/TSMCB.2005.863377