Improving Deep Learning-based Plant Disease Classification with Attention Mechanism
Tóm tắt
In recent years, deep learning-based plant disease classification has been widely developed. However, it is challenging to collect sufficient annotated image data to effectively train deep learning models for plant disease recognition. The attention mechanism in deep learning assists the model to focus on the informative data segments and extract the discriminative features of inputs to enhance training performance. This paper investigates the Convolutional Block Attention Module (CBAM) to improve classification with CNNs, which is a lightweight attention module that can be plugged into any CNN architecture with negligible overhead. Specifically, CBAM is applied to the output feature map of CNNs to highlight important local regions and extract more discriminative features. Well-known CNN models (i.e. EfficientNetB0, MobileNetV2, ResNet50, InceptionV3, and VGG19) were applied to do transfer learning for plant disease classification and then fine-tuned by a publicly available plant disease dataset of foliar diseases in pear trees called DiaMOS Plant. Amongst others, this dataset contains 3006 images of leaves affected by different stress symptoms. Among the tested CNNs, EfficientNetB0 has shown the best performance. EfficientNetB0+CBAM has outperformed EfficientNetB0 and obtained 86.89% classification accuracy. Experimental results show the effectiveness of the attention mechanism to improve the recognition accuracy of pre-trained CNNs when there are few training data.
Từ khóa
Tài liệu tham khảo
Alirezazadeh P, Rahimi-Ajdadi F, Abbaspour-Gilandeh Y et al (2021) Improved digital image-based assessment of soil aggregate size by applying convolutional neural networks. Comput Electron Agric 191:499. https://doi.org/10.1016/j.compag.2021.106499
Alirezazadeh P, Dornaika F, Moujahid A (2022) Deep learning with discriminative margin loss for cross-domain consumer-to-shop clothes retrieval. Sensors 22(7):2660. https://doi.org/10.3390/s22072660
Ayinde BO, Inanc T, Zurada JM (2019) Redundant feature pruning for accelerated inference in deep neural networks. Neural Networks 118:148–158. https://doi.org/10.1016/j.neunet.2019.04.021
de Camargo T, Schirrmann M, Landwehr N et al (2021) Optimized deep learning model as a basis for fast uav mapping of weed species in winter wheat crops. Remote Sens 13(9):1704. https://doi.org/10.3390/rs13091704
Chen L, Zhang H, Xiao J et al (2017) SCA-CNN: Spatial and channel-wise attention in convolutional networks for image captioning, pp 6298–6306 https://doi.org/10.1109/CVPR.2017.667
Fenu G, Malloci FM (2021) Diamos plant: A dataset for diagnosis and monitoring plant disease. Agronomy 11(11):2107. https://doi.org/10.3390/agronomy11112107
Ferentinos KP (2018) Deep learning models for plant disease detection and diagnosis. Comput Electron Agric 145:311–318. https://doi.org/10.1016/j.compag.2018.01.009
Fuentes A, Yoon S, Kim SC et al (2017) A robust deep-learning-based detector for real-time tomato plant diseases and pests recognition. Sensors 17(9):2022. https://doi.org/10.3390/s17092022
Fuentes A, Yoon S, Park DS (2019) Deep learning-based phenotyping system with glocal description of plant anomalies and symptoms. Front Plant Sci 10:1321. https://doi.org/10.3389/fpls.2019.01321
He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 https://doi.org/10.1109/CVPR.2016.90
He K, Girshick R, Dollár P (2019) Rethinking imagenet pre-training. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 4918–4927 https://doi.org/10.1109/ICCV.2019.00502
Howard AG, Zhu M, Chen B et al (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications (arXiv preprint arXiv:170404861)
Kumar M, Gupta P, Madhav P et al (2020) Disease detection in coffee plants using convolutional neural network. In: 2020 5th International Conference on Communication and Electronics Systems (ICCES). IEEE, pp 755–760 https://doi.org/10.1109/ICCES48766.2020.9138000
Lee SH, Goeau H, Bonnet P et al (2020a) Attention-based recurrent neural network for plant disease classification. Front Plant Sci 11:1897. https://doi.org/10.3389/fpls.2020.601250
Lee SH, Goeau H, Bonnet P et al (2020b) New perspectives on plant disease characterization based on deep learning. Comput Electron Agric 170:105220. https://doi.org/10.1016/j.compag.2020.105220
Liu J, Wang X (2020) Tomato diseases and pests detection based on improved yolo v3 convolutional neural network. Front Plant Sci 11:898. https://doi.org/10.3389/fpls.2020.00898
Liu B, Zhang Y, He D et al (2017) Identification of apple leaf diseases based on deep convolutional neural networks. Symmetry 10(1):11. https://doi.org/10.3390/sym10010011
Liu B, Ding Z, Tian L et al (2020) Grape leaf disease identification using improved deep convolutional neural networks. Front Plant Sci 11:1082. https://doi.org/10.3389/fpls.2020.01082
Mohanty SP, Hughes DP, Salathé M (2016) Using deep learning for image-based plant disease detection. Front Plant Sci 7:1419. https://doi.org/10.3389/fpls.2016.01419
Pedregosa F, Varoquaux G, Gramfort A et al (2011) Scikit-learn: Machine learning in python. J Mach Learn Res 12:2825–2830
Raghu M, Zhang C, Kleinberg J et al (2019) Transfusion: Understanding transfer learning for medical imaging. In: Advances in neural information processing systems, p 32
Ramcharan A, Baranowski K, McCloskey P et al (2017) Deep learning for image-based cassava disease detection. Front Plant Sci 8:1852. https://doi.org/10.3389/fpls.2017.01852
Russakovsky O, Deng J, Su H et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252. https://doi.org/10.1007/s11263-015-0816-y
Sandler M, Howard A, Zhu M et al (2018) Mobilenetv2: Inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4510–4520 https://doi.org/10.1109/CVPR.2018.00474
Schirrmann M, Landwehr N, Giebel A et al (2021) Early detection of stripe rust in winter wheat using deep residual neural networks. Front Plant Sci 12:475. https://doi.org/10.3389/fpls.2021.469689
Selvaraj MG, Vergara A, Ruiz H et al (2019) Ai-powered banana diseases and pest detection. Plant Methods 15(1):1–11. https://doi.org/10.1186/s13007-019-0475-z
Selvaraju RR, Cogswell M, Das A et al (2017) Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626 https://doi.org/10.1109/ICCV.2017.74
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition (arXiv preprint arXiv:14091556)
Szegedy C, Ioffe S, Vanhoucke V et al (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: Thirty-first AAAI conference on artificial intelligence https://doi.org/10.1609/aaai.v31i1.11231
Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In: International conference on machine learning, PMLR, pp 6105–6114
Tavakoli H, Alirezazadeh P, Hedayatipour A et al (2021) Leaf image-based classification of some common bean cultivars using discriminative convolutional neural networks. Comput Electron Agric 181:105935. https://doi.org/10.1016/j.compag.2020.105935
Toda Y, Okura F (2019) How convolutional neural networks diagnose plant disease. Plant Phenomics. https://doi.org/10.34133/2019/9237136
Too EC, Yujian L, Njuki S et al (2019) A comparative study of fine-tuning deep learning models for plant disease identification. Comput Electron Agric 161:272–279. https://doi.org/10.1016/j.compag.2018.03.032
Vaswani A, Shazeer N et al (2017) Attention is all you need. In: Advances in neural information processing systems (http://papers.nips.cc/paper/7181-attention-is-all-you-need)