Improving Deep Learning-based Plant Disease Classification with Attention Mechanism

Pendar Alirezazadeh1, Michael Schirrmann1, Frieder Stolzenburg2
1Department of Engineering for Crop Production, Leibniz Institute for Agricultural Engineering and Bioeconomy (ATB), Max-Eyth-Allee 100, 14469, Potsdam, Germany
2Automation and Computer Sciences Department, Harz University of Applied Sciences, Friedrichstr. 57–59, 38855, Wernigerode, Germany

Tóm tắt

AbstractIn recent years, deep learning-based plant disease classification has been widely developed. However, it is challenging to collect sufficient annotated image data to effectively train deep learning models for plant disease recognition. The attention mechanism in deep learning assists the model to focus on the informative data segments and extract the discriminative features of inputs to enhance training performance. This paper investigates the Convolutional Block Attention Module (CBAM) to improve classification with CNNs, which is a lightweight attention module that can be plugged into any CNN architecture with negligible overhead. Specifically, CBAM is applied to the output feature map of CNNs to highlight important local regions and extract more discriminative features. Well-known CNN models (i.e. EfficientNetB0, MobileNetV2, ResNet50, InceptionV3, and VGG19) were applied to do transfer learning for plant disease classification and then fine-tuned by a publicly available plant disease dataset of foliar diseases in pear trees called DiaMOS Plant. Amongst others, this dataset contains 3006 images of leaves affected by different stress symptoms. Among the tested CNNs, EfficientNetB0 has shown the best performance. EfficientNetB0+CBAM has outperformed EfficientNetB0 and obtained 86.89% classification accuracy. Experimental results show the effectiveness of the attention mechanism to improve the recognition accuracy of pre-trained CNNs when there are few training data.

Từ khóa


Tài liệu tham khảo

Alirezazadeh P, Rahimi-Ajdadi F, Abbaspour-Gilandeh Y et al (2021) Improved digital image-based assessment of soil aggregate size by applying convolutional neural networks. Comput Electron Agric 191:499. https://doi.org/10.1016/j.compag.2021.106499

Alirezazadeh P, Dornaika F, Moujahid A (2022) Deep learning with discriminative margin loss for cross-domain consumer-to-shop clothes retrieval. Sensors 22(7):2660. https://doi.org/10.3390/s22072660

Ayinde BO, Inanc T, Zurada JM (2019) Redundant feature pruning for accelerated inference in deep neural networks. Neural Networks 118:148–158. https://doi.org/10.1016/j.neunet.2019.04.021

de Camargo T, Schirrmann M, Landwehr N et al (2021) Optimized deep learning model as a basis for fast uav mapping of weed species in winter wheat crops. Remote Sens 13(9):1704. https://doi.org/10.3390/rs13091704

Chen L, Zhang H, Xiao J et al (2017) SCA-CNN: Spatial and channel-wise attention in convolutional networks for image captioning, pp 6298–6306 https://doi.org/10.1109/CVPR.2017.667

Fenu G, Malloci FM (2021) Diamos plant: A dataset for diagnosis and monitoring plant disease. Agronomy 11(11):2107. https://doi.org/10.3390/agronomy11112107

Ferentinos KP (2018) Deep learning models for plant disease detection and diagnosis. Comput Electron Agric 145:311–318. https://doi.org/10.1016/j.compag.2018.01.009

Fuentes A, Yoon S, Kim SC et al (2017) A robust deep-learning-based detector for real-time tomato plant diseases and pests recognition. Sensors 17(9):2022. https://doi.org/10.3390/s17092022

Fuentes A, Yoon S, Park DS (2019) Deep learning-based phenotyping system with glocal description of plant anomalies and symptoms. Front Plant Sci 10:1321. https://doi.org/10.3389/fpls.2019.01321

He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 https://doi.org/10.1109/CVPR.2016.90

He K, Girshick R, Dollár P (2019) Rethinking imagenet pre-training. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 4918–4927 https://doi.org/10.1109/ICCV.2019.00502

Howard AG, Zhu M, Chen B et al (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications (arXiv preprint arXiv:170404861)

Kumar M, Gupta P, Madhav P et al (2020) Disease detection in coffee plants using convolutional neural network. In: 2020 5th International Conference on Communication and Electronics Systems (ICCES). IEEE, pp 755–760 https://doi.org/10.1109/ICCES48766.2020.9138000

Lee SH, Goeau H, Bonnet P et al (2020a) Attention-based recurrent neural network for plant disease classification. Front Plant Sci 11:1897. https://doi.org/10.3389/fpls.2020.601250

Lee SH, Goeau H, Bonnet P et al (2020b) New perspectives on plant disease characterization based on deep learning. Comput Electron Agric 170:105220. https://doi.org/10.1016/j.compag.2020.105220

Liu J, Wang X (2020) Tomato diseases and pests detection based on improved yolo v3 convolutional neural network. Front Plant Sci 11:898. https://doi.org/10.3389/fpls.2020.00898

Liu B, Zhang Y, He D et al (2017) Identification of apple leaf diseases based on deep convolutional neural networks. Symmetry 10(1):11. https://doi.org/10.3390/sym10010011

Liu B, Ding Z, Tian L et al (2020) Grape leaf disease identification using improved deep convolutional neural networks. Front Plant Sci 11:1082. https://doi.org/10.3389/fpls.2020.01082

Mohanty SP, Hughes DP, Salathé M (2016) Using deep learning for image-based plant disease detection. Front Plant Sci 7:1419. https://doi.org/10.3389/fpls.2016.01419

Pedregosa F, Varoquaux G, Gramfort A et al (2011) Scikit-learn: Machine learning in python. J Mach Learn Res 12:2825–2830

Raghu M, Zhang C, Kleinberg J et al (2019) Transfusion: Understanding transfer learning for medical imaging. In: Advances in neural information processing systems, p 32

Ramcharan A, Baranowski K, McCloskey P et al (2017) Deep learning for image-based cassava disease detection. Front Plant Sci 8:1852. https://doi.org/10.3389/fpls.2017.01852

Russakovsky O, Deng J, Su H et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252. https://doi.org/10.1007/s11263-015-0816-y

Sandler M, Howard A, Zhu M et al (2018) Mobilenetv2: Inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4510–4520 https://doi.org/10.1109/CVPR.2018.00474

Schirrmann M, Landwehr N, Giebel A et al (2021) Early detection of stripe rust in winter wheat using deep residual neural networks. Front Plant Sci 12:475. https://doi.org/10.3389/fpls.2021.469689

Selvaraj MG, Vergara A, Ruiz H et al (2019) Ai-powered banana diseases and pest detection. Plant Methods 15(1):1–11. https://doi.org/10.1186/s13007-019-0475-z

Selvaraju RR, Cogswell M, Das A et al (2017) Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626 https://doi.org/10.1109/ICCV.2017.74

Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition (arXiv preprint arXiv:14091556)

Szegedy C, Ioffe S, Vanhoucke V et al (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: Thirty-first AAAI conference on artificial intelligence https://doi.org/10.1609/aaai.v31i1.11231

Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In: International conference on machine learning, PMLR, pp 6105–6114

Tavakoli H, Alirezazadeh P, Hedayatipour A et al (2021) Leaf image-based classification of some common bean cultivars using discriminative convolutional neural networks. Comput Electron Agric 181:105935. https://doi.org/10.1016/j.compag.2020.105935

Toda Y, Okura F (2019) How convolutional neural networks diagnose plant disease. Plant Phenomics. https://doi.org/10.34133/2019/9237136

Too EC, Yujian L, Njuki S et al (2019) A comparative study of fine-tuning deep learning models for plant disease identification. Comput Electron Agric 161:272–279. https://doi.org/10.1016/j.compag.2018.03.032

Vaswani A, Shazeer N et al (2017) Attention is all you need. In: Advances in neural information processing systems (http://papers.nips.cc/paper/7181-attention-is-all-you-need)

Woo S, Park J, Lee JY et al (2018) Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19 https://doi.org/10.1007/978-3-030-01234-2_1