Multiclass anomaly detection in imbalanced structural health monitoring data using convolutional neural network

Mengchen Zhao1, Ayan Sadhu2, Miriam A. M. Capretz1
1Department of Electrical and Computer Engineering, Western University, London, Ontario, Canada
2Department of Civil and Environmental Engineering, Western University, London, Ontario, Canada

Tóm tắt

AbstractStructural health monitoring (SHM) system aims to monitor the in-service condition of civil infrastructures, incorporate proactive maintenance, and avoid potential safety risks. An SHM system involves the collection of large amounts of data and data transmission. However, due to the normal aging of sensors, exposure to outdoor weather conditions, accidental incidences, and various operational factors, sensors installed on civil infrastructures can get malfunctioned. A malfunctioned sensor induces significant multiclass anomalies in measured SHM data, requiring robust anomaly detection techniques as an essential data cleaning process. Moreover, civil infrastructure often has imbalanced anomaly data where most of the SHM data remain biased to a certain type of anomalies. This imbalanced time-series data causes significant challenges to the existing anomaly detection methods. Without proper data cleaning processes, the SHM technology does not provide useful insights even if advanced damage diagnostic techniques are applied. This paper proposes a hyperparameter-tuned convolutional neural network (CNN) for multiclass imbalanced anomaly detection (CNN-MIAD) modelling. The hyperparameters of the proposed model are tuned through a random search algorithm to optimize the performance. The effect of balancing the database is considered by augmenting the dataset. The proposed CNN-MIAD model is demonstrated with a multiclass time-series of anomaly data obtained from a real-life cable-stayed bridge under various cases of data imbalances. The study concludes that balancing the database with a time shift window to increase the database has generated the optimum results, with an overall accuracy of 97.74%.

Từ khóa


Tài liệu tham khảo

Almasri N, Sadhu A, Ray Chaudhuri S (2020) Towards compressed sensing of structural monitoring data using discrete cosine transform. ASCE J Comput Eng 34(1):04019041

Arul, M. and Kareem, A (2020) Data anomaly detection for structural health monitoring of bridges using Shapelet transform. ArXiv

Bao Y, Chen Z, Wei S, Xu Y, Tang Z, Li H (2019) The state of the art of data science and engineering in structural health monitoring. Engineering. 5(2):234–242

Bao Y, Tang Z, Li H, Zhang Y (2018) Computer vision and deep learning-based data anomaly detection method for structural health monitoring. Structur Health Monit 18(2):401–421

Barbosh M, Sadhu A, Sankar G (2021) Time-frequency decomposition assisted improved localization of proximity of damage using acoustic sensors. Smart Mater Struct 30(2):025021

Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13(10):281–305

Cawley P (2018) Structural health monitoring: closing the gap between research and industrial deployment. Struct Health Monit 17(5):1225–1244

Chen J, Pi D, Wu Z, Zhao X, Pan Y, Zhang Q (2021) Imbalanced satellite telemetry data anomaly detection model based on Bayesian LSTM. Acta Astronautica 180:232–242

Ciresan D, Meier U, Masci J, Gambardella L, Schmidhuber J (2011) Flexible, high-performance convolutional neural networks for image classification. International Joint Conference on Artificial Intelligence IJCAI–2011

Das S, Saha P (2018) A review of some advanced sensors used for health diagnosis of civil engineering structures. Measurement 129:68–90

Debnath, A., Waghmare, G., Wadhwa, H., Asthana, S. and Arora, A. (2021). Exploring generative data augmentation in multivariate time series forecasting : opportunities and challenges. MileTS’21: 7th KDD Workshop on Mining and Learning from Time Series

Delgado J, Oyedele L (2019) Deep learning with small datasets: using autoencoders to address limited datasets in construction management. Appl Soft Comput 112:107836

Dong C, Catbas N (2020) A review of computer vision-based structural health monitoring at local and global levels. Struct Health Monit 20(2):692–743

Duman E, Tolan Z (2021) Comparing popular CNN models for an imbalanced dataset of dermoscopic images. J Comput Science IDAP-2021:192–207

Galar, M., Fernandez, A., Barrenechea, E., Bustince H. and Herrera, F. A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews). 42(4)

Heaton, J. (2008). Introduction to neural networks with Java. Heaton Research, Inc. Second Edition. Pg 158

Hutter, F., Kotthoff, L. and Vanschoren, J. (2019). Automated machine learning: methods, systems, challenges. The springer series on challenges in machine learning. Chapter 1

Islam A, Belhaouari SB, Rehman AU, Bensmail H (2022) KNNOR: an oversampling technique for imbalanced datasets. Appl Soft Comput 115:108288

Jain M, Kaur G, Saxena V (2022) A K-means clustering and SVM based hybrid concept drift detection technique for network anomaly detection. Expert Syst Appl 193:116510

Jana D, Patil J, Herkal S, Nagarajaiah S, Duenas-Osorio L (2022) CNN and convolutional autoencoder (CAE) based real-time sensor fault detection, localization, and correction. Mech Syst Signal Process 169:108723

Jeong S, Ferguson M, Law K (2019) Sensor data reconstruction and anomaly detection using bidirectional recurrent neural network. Sensors Smart Struct Technol Civil Mechan Aerospace Syst 2019:10970

Kuhn M, Johnson K (2016) Applied predictive modelling. Springer Nature

Kullaa J (2013) Detection, identification, and quantification of sensor fault in a sensor network. Mech Syst Signal Process 40:208–221

Kaartinen E, Dunphy K, Sadhu A (2022) LiDAR-based structural health monitoring: applications in civil infrastructure systems. Sensors. 22(8):4610

Li H, Ou J (2016) The state-of-the-art in structural health monitoring of cable-stayed bridges. J Civ Struct Heal Monit 6(1):43–67

Liu G, Niu Y, Zhao W, Duan Y, Shu J (2022) Data anomaly detection for structural health monitoring using a combination network of GANomaly and CNN. Smart Struct Syst 29(1):5362

Maes K, Salens W, Feremans G, Segher K, François S (2022) Anomaly detection in long-term tunnel deformation monitoring. Eng Struct 250:113383

Mao J, Wang H, Spencer B (2020) Toward data anomaly detection for automated structural health monitoring: exploiting generative adversarial nets and autoencoders. Struct Health Monit 20(4):1609–1626

Montgomery D (2013) Design and analysis of experiments, 8th edn. John Wiley & Sons, Inc. Chapter 5

Neves A, Gonzalez I, Leander J, Karoumi R (2017) Structural health monitoring of bridges: a model-free ANN-based approach to damage detection. J Civ Struct Heal Monit 7(5)

Ni F, Zhang J, Noori M (2019) Deep learning for data anomaly detection and data compression of a long-span suspension bridge. Comput Aid Civil Infrastruct Eng 35(7):685–700

Panda B. (2019). A survey on application of population-based algorithm on Hyperparameter selection

Phung VH, Rhee EJ (2018) A deep learning approach for classification of cloud image patches on small datasets. J Inform Commun Converg Eng 16(3):173–178

Ranyal E, Sadhu A, Jain K (2022) Road condition monitoring using smart sensing and artificial intelligence: a review. Sensors. 22(8):3044

Rawat W, Wang Z (2017) Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput 29(9):2352–2449

Sai B, Anita CS, Rajalakshmi D, Berlin MA (2021) A CNN-based facial expression recognizer. Materials Today 37(2):2578–2581

Sarmadi H, Karamodin A (2020) Novel anomaly detection method based on adaptive Mahalanobis-squared distance and one-class kNN rule for structural health monitoring under environmental effects. Mech Syst Signal Process 140:106495

Scherer, D., Müller, A. and Behnke, S. (2010). Evaluation of pooling operations in convolutional architectures for object recognition. 20th International Conference on Artificial Neural Networks

Shahriari B, Swersky K, Wang Z, Adams RP, De Freitas N (2016) Taking the human out of the loop: a review of Bayesian optimization. Proc IEEE 104(1)

Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(60)

Singh P, Sadhu A (2022) A hybrid time-frequency method for robust drive-by modal identification of bridges. Eng Struct 266:114624

Singh P, Keyvanlou M, Sadhu A (2021) An improved time-varying empirical mode decomposition for structural condition assessment using limited sensors. Eng Struct 232:111882

Snoek J, Larochelle H, Adams RP (2012) Practical Bayesian optimization of machine learning algorithms. ArXiv

Sony S, Dunphy K, Sadhu A, Capretz M (2021) A systematic review of convolutional neural network-based structural condition assessment techniques. Eng Struct 226:111347

Sony S, Laventure S, Sadhu A (2019) A literature review of next-generation smart sensing technology in structural health monitoring. Struct Control Health Monit 26(3):e2321

Steiner, M., Fritz, H., Thiebes, L. and Dragos, K. (2018). Fault diagnosis in wireless structural health monitoring systems based on support vector regression

Tang Z, Chen Z, Bao Y, Li H (2018) Convolutional neural network-based data anomaly detection method using multiple information for structural health monitoring. Struct Control Health Monit 26(1):e2296

Thiyagarajan, K., Kodagoda, S. and Nguyen, L. (2017). Predictive analytics for detecting sensor failure using autoregressive integrated moving average model. 12th IEEE Conference on Industrial Electronics and Applications

Tian Y, Zhang Y (2022) A comprehensive survey on regularization strategies in machine learning. Inform Fusion 80:146–166

Wang H, Li L, Song G, Dabney JB, Harman TL (2015) A new approach to deal with sensor errors in structural controls with MR damper. Smart Struct Syst 16(2)

Wang Y, Zhu Y, Lou G, Zhang P, Chen J, Li J (2021) A maintenance hemodialysis mortality prediction model based on anomaly detection using longitudinal hemodialysis data. J Biomed Inform 123:103930

Wedel F, Marx S (2022) Application of machine learning methods on real bridge monitoring data. Eng Struct 250:113365

Yang J, Yang F, Zhang L, Li R, Jiang S, Wang G, Zhang L, Zeng Z (2021) Bridge health anomaly detection using deep support vector data description. Struct Health Monit 20(6):170–178

Zeiler M (2012) Adadelta: an adaptive learning rate method. ArXiv

Zhang W, Li C, Peng G, Chen Y, Zhang Z (2018) A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load. Mech Syst Signal Process 100:439–453

Zhu Y, Ni Y, Jin H, Inaudi D, Laory I (2019) A temperature-driven MPCA method for structural anomaly detection. Eng Struct 190:447–458