Texture-based latent space disentanglement for enhancement of a training dataset for ANN-based classification of fruit and vegetables

Information Processing in Agriculture - Tập 10 - Trang 85-105 - 2023
Khurram Hameed1, Douglas Chai1, Alexander Rassau1
1School of Engineering, Edith Cowan University, 270 Joondalup Drive, Joondalup, WA 6027, Perth, Australia

Tài liệu tham khảo

Zhang, 2014, Fruit classification using computer vision and feedforward neural network, J Food Eng, 143, 167, 10.1016/j.jfoodeng.2014.07.001 Zhang, 2019, Image based fruit category classification by 13-layer deep convolutional neural network and data augmentation, Multimedia Tools Appl, 78, 3613, 10.1007/s11042-017-5243-3 Nasirahmadi, 2017, Bag-of-Feature model for sweet and bitter almond classification, Biosyst Eng, 156, 51, 10.1016/j.biosystemseng.2017.01.008 Behera, 2019, Automatic classification of mango using statistical feature and SVM, Adv Comput Commun Control Lecture Notes Netw Syst, 41, 469, 10.1007/978-981-13-3122-0_47 Tripathi, 2019, A role of computer vision in fruits and vegetables among various horticulture products of agriculture fields: a survey, Inform Process Agric, 7, 183 Rehman, 2019, Current and future applications of statistical machine learning algorithms for agricultural machine vision systems, Comput Electron Agric, 156, 585, 10.1016/j.compag.2018.12.006 Bhargava, 2021, Fruits and vegetables quality evaluation using computer vision: a review, J King Saud Univ – Comput Inform Sci, 33, 243 Moallem, 2017, Computer vision-based apple grading for golden delicious apples based on surface features, Inform Process Agric, 4, 33 Nouri-Ahmadabadi, 2017, Design, development and evaluation of an online grading system for peeled pistachios equipped with machine vision technology and support vector machine, Inform Process Agric, 4, 333 Arakeri, 2016, Computer vision based fruit grading system for quality evaluation of tomato in agriculture industry, Proc Comput Sci, 79, 426, 10.1016/j.procs.2016.03.055 Jhawar, 2016, Orange sorting by applying pattern recognition on colour image, Procedia Comput Sci, 78, 691, 10.1016/j.procs.2016.02.118 Sofu, 2016, Design of an automatic apple sorting system using machine vision, Comput Electron Agric, 127, 395, 10.1016/j.compag.2016.06.030 Mahendran, 2012, Application of computer vision technique on sorting and grading of fruits and vegetables, J Food Process Technol, 10, 2157 Al Ohali, 2011, Computer vision based date fruit grading system: design and implementation, J King Saud Univ – Comput Inform Sci, 23, 29 Liming, 2010, Automated strawberry grading system based on image processing, Comput Electron Agric, 71, 32, 10.1016/j.compag.2009.09.013 Hameed, 2018, A comprehensive review of fruit and vegetable classification techniques, Image Vis Comput, 80, 24, 10.1016/j.imavis.2018.09.016 Hameed K, Chai D, Rassau A. A progressive weighted average weight optimisation ensemble technique for fruit and vegetable classification. In: Proc of the 16th international conference on control, automation, robotics and vision, ICARCV. Shenzhen, China; 2020. p. 303–8. Hameed, 2020, A sample weight and adaboost cnn-based coarse to fine classification of fruit and vegetables at a supermarket self-checkout, Appl Sci, 10, 8667, 10.3390/app10238667 Hussain Hassan, 2019, New effective techniques for automatic detection and classification of external olive fruits defects based on image processing techniques, Multidimension Syst Signal Process, 30, 571, 10.1007/s11045-018-0573-5 Habib, 2020, Machine vision based papaya disease recognition, J of King Saud Univ – Comput Inform Sci, 32, 300 Braverman, 2008, Polylogarithmic independence fools AC0 circuits, J Assoc Comput Mach, 57, 10 Sun C, Shrivastava A, Singh S, Gupta A. Revisiting unreasonable effectiveness of data in deep learning era. In: Proc of the 15th IEEE international conference on computer vision, ICCV. Venice, Italy; 2017. p. 843–52. Bengio Y, Delalleau O, Roux NL. The curse of highly variable functions for local kernel machines. In: Proc of the 19th international conference on neural information processing systems, NIPS. Vancouver, Canada; 2005. p. 107–14. Bottou, 2013, From machine learning to machine reasoning: an essay, J Mach Learn Res, 94, 3207 Takahashi, 2019, Data augmentation using random image cropping and patching for deep CNNs, IEEE Trans Circ Syst Video Technol, 30, 2917, 10.1109/TCSVT.2019.2935128 Yang, 2009, A survey on transfer learning, IEEE Trans Knowl Data Eng, 22, 1345 Kingma DP, Welling M. Auto-encoding variational bayes. In: Proc of the 2nd international conference on learning representations, ICLR. Banff, AB, Canada; 2014. p. 214–28. Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, et al.. Generative adversarial nets. In: Proc of the 35th neural information processing systems, NIPS. Montreal, Canada; 2014. p. 2672–80. Theis L, van den Oord A, Bethge M. A note on the evaluation of generative models. In: Proc of the 4th international conference on learning representations, ICLR. San Juan, Puerto Rico; 2016. p. 25–35. Jalwana MAAK, Akhtar N, Bennamoun M, Mian A. Attack to explain deep representation. In: Proc of the IEEE conference on computer vision and pattern recognition CVPR. Seattle, WA, USA; 2020. p. 9540–9. Shah, 2017, Network intrusion detection through discriminative feature selection by using sparse logistic regression, Future Internet, 9, 81, 10.3390/fi9040081 Alzahrani, 2020, A novel method for feature learning and network intrusion classification, Alexandria Eng J, 59, 1159, 10.1016/j.aej.2020.01.021 Zardari, 2020, A lightweight technique for detection and prevention of wormhole attack in manet, EAI Endorsed Trans Scalable Inform Syst, 8, 1 Bousmalis K, Silberman N, Dohan D, Erhan D, Krishnan D. Unsupervised pixel-level domain adaptation with generative adversarial networks. In: Proc of the IEEE conference on computer vision and pattern recognition, CVPR. Honolulu, Hawaii, USA; 2017. p. 95–104. Le Roux, 2008, Representational power of restricted boltzmann machines and deep belief networks, Neural Comput, 20, 1631, 10.1162/neco.2008.04-07-510 Hinton, 2006, A fast learning algorithm for deep belief nets, Neural Comput, 18, 1527, 10.1162/neco.2006.18.7.1527 Rezende DJ, Mohamed S, Wierstra D. Stochastic backpropagation and approximate inference in deep generative models. In: Proc of the 31st international conference on machine learning, ICML. Beijing, China; 2014. p. 3057–70. Van Den Oord A, Kalchbrenner N, Kavukcuoglu K. Pixel recurrent neural networks. In: Proc of the 33rd international conference on machine learning, ICML. New York, USA; 2016. p. 2611–20. Bengio Y, Thibodeau-Laufer É, Alain G, Yosinski J. Deep generative stochastic networks trainable by backprop. In: Proc of the 31st international conference on machine learning, ICML. Beijing, China; 2014. p. 1470–85. Van Den Oord A, Kalchbrenner N, Vinyals O, Espeholt L, Graves A, Kavukcuoglu K. Conditional image generation with PixelCNN decoders. In: Proc of the 30th conference on neural information processing systems, NIPS. Barcelona, Spain; 2016. p. 4797–805. Arjovsky M, Bottou L. Towards principled methods for training generative adversarial networks. In: Proc. of the 5th international conference on learning representations, ICLR. Toulon, France; 2017. p. 1355–72. Che T, Li Y, Jacob AP, Bengio Y, Li W. Mode regularized generative adversarial networks. In: Proc of the 5th international conference on learning representations, ICLR. Toulon, France; 2017. p. 82–6. Ghosh A, Kulharia V, Namboodiri V, Torr PH, Dokania PK. Multi-agent diverse generative adversarial networks. In: Proc of the IEEE conference on computer vision and pattern recognition, CVPR. Utah, USA; 2018. p. 8513–21. Salimans Tim, Goodfellow Ian, Zaremba Wojciech, Cheung Vicki, Radford Alec, Chen X. Improved techniques for training GANs. In: Proc of the 30th conference on neural information processing systems, NIPS. Barcelona, Spain; 2016. p. 2234–42. Yan X, Yang J, Sohn K, Lee H. Attribute2image: Conditional image generation from visual attributes. In: Proc of the 14th European conference on computer vision ECCV. Amsterdam, Netherlands; 2016. p. 776–91. Chen X, Duan Y, Houthooft R, Schulman J, Sutskever I, Abbeel P. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In: Proc of the 30th conference on neural information processing systems, NIPS. Barcelona, Spain; 2016. p.2172–80. Zhu JY, Krähenbühl P, Shechtman E, Efros AA. Generative visual manipulation on the natural image manifold. In: Proc of the 14th European conference on computer vision, ECCV. Amsterdam, Netherlands; 2016. p. 597–613. Isola P, Zhu JY, Zhou T, Efros AA. Image-to-image translation with conditional adversarial networks. In: Proc of the IEEE conference on computer vision and pattern recognition, CVPR. Honolulu, Hawaii, USA; 2017. p. 1125–34. Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A et al. Photo-realistic single image super-resolution using a generative adversarial network. In: Proc of the IEEE conference on computer vision and pattern recognition CVPR. Honolulu, Hawaii, USA; 2017. p. 105–14. Zhang, 2019, StackGAN++: realistic image synthesis with stacked generative adversarial networks, IEEE Trans Pattern Anal Mach Intell, 41, 1947, 10.1109/TPAMI.2018.2856256 Mansimov E, Parisotto E, Ba LJ, Salakhutdinov R. Generating images from captions with attention. In: Proc of the 4th international conference on learning representations, ICLR. San Juan, Puerto Rico; 2016. p. 40–8. Nguyen A, Clune J, Bengio Y, Dosovitskiy A, Yosinski J. Plug & play generative networks: conditional iterative generation of images in latent space. In: Proc of the IEEE conference on computer vision and pattern recognition, CVPR. Honolulu, Hawaii, USA; 2017. p. 4467–77. Wang X, Gupta A. Generative image modelling using style and structure adversarial networks. In: Proc of the 14th European conference on computer vision, ECCV. Amsterdam, Netherlands; 2016. p. 318–35. Denton EL, Chintala S, Fergus R, et al. Deep generative image models using a laplacian pyramid of adversarial networks. In: Proc of the 29th international conference on neural information processing systems, NIPS. Montreal, Canada; 2015. p. 1486–94. T Karras, T Alia, SLiane J. Progressively growing of GANs for improved quality, stability and variation. In: Proc of the 6th international conference on learning representations, ICLR. Vancouver, Canada; 2018. p. 50–6. Desjardins G, Courville A, Bengio Y. Disentangling factors of variation via generative entangling. In: Proc of the 26th neural information processing systems, NIPS. Lake Tahoe, USA; 2012. p. 5040–8. Karras T, Laine S, Aila T. A style-based generator architecture for generative adversarial networks. In: Proc of the IEEE conference on computer vision and pattern recognition, CVPR. Long Beach, CA, USA; 2019. p. 4401–10. Hénaff OJ, Razavi A, Doersch C, Eslami S, Oord Avd. Data-efficient image recognition with contrastive predictive coding. In: Proc of the 8th international conference on learning representations, ICLR. Addis Ababa, Ethiopia; 2020. p. 1–10. Van den Oord, 2018, Representation learning with contrastive predictive coding, Comput Res Repos, 1807.03748, 1 Zhang R, Isola P, Efros AA. Colorful image colorization. In: Proc of the 14th European conference on computer vision, ECCV. Amsterdam, Netherlands; 2016. p. 649–66. Pu Y, Gan Z, Henao R, Yuan X, Li C, Stevens A, Carin L. Variational autoencoder for deep learning of images, labels and captions. In: Proc of the 30th conference on neural information processing systems, NIPS. Barcelona, Spain; 2016. p. 2360–68. Zeiler MD, Taylor GW, Fergus R. Adaptive deconvolutional networks for mid and high level feature learning. In: Proc of the 13th IEEE international conference on computer vision, ICCV. Barcelona, Spain; 2011. p. 2018–25. Kim H, Mnih A. Disentangling by factorising. In: Proc of the 25th international conference on machine learning, ICML. Stockholm, Sweden; 2018. p. 4153–71. Hoffman MD, Johnson MJ. ELBO surgery: yet another way to carve up the variational evidence lower bound. In: Proc of the 30th neural information processing systems, NIPS. Barcelona, Spain; 2016. p. 1177–83. Hjelm RD, Jacob AP, Che T, Trischler A, Cho K, Bengio Y. Boundary-seeking generative adversarial networks. In: Proc of the 6th international conference on learning representations, ICLR. Vancouver, Canada; 2018. p. 1–17. Zhao J, Mathieu M, LeCun Y. Metric-based generative adversarial network. In: Proc of the 5th international conference on learning representations, ICLR. Toulon, France; 2017. p. 672–80. Arjovsky M, Chintala S, Bottou L. Wasserstein generative adversarial networks. In: Proc of the 34th international conference on machine learning, ICML. Sydney, Australia; 2017. p. 214–23. Odena A, Olah C, Shlens J. Conditional image synthesis with auxiliary classifier GANs. In: Proc of the 34th international conference on machine learning, ICML. Sydney, Australia; 2017. p. 2642–51. Chen X, Duan Y, Houthooft R, Schulman J, Sutskever I, Abbeel P. InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. In: Proc of the 30th conference on Neural Information Processing Systems, NIPS. Barcelona, Spain; 2016. p. 2180–8. Creswell, 2018, Generative adversarial networks: an overview, IEEE Signal Process Mag, 35, 53, 10.1109/MSP.2017.2765202 Karras T, Laine S, Aittala M, Hellsten J, Lehtinen J, Aila T. Analyzing and improving the image quality of StyleGAN. In: Proc of the IEEE conference on computer vision and pattern recognition CVPR. Seattle, WA, USA; 2020. p. 8107–16. Durugkar IP, Gemp I, Mahadevan S. Generative multi-adversarial networks. In: Proc of the 5th international conference on learning representations, ICLR. Toulon, France; 2017. p. 25–45. Doan T, Monteiro J, Albuquerque I, Mazoure B, Durand A, Pineau J, Hjelm RD. On-line adaptative curriculum learning for gans. In: Proc of the 33rd AAAI conference on artificial intelligence. Honolulu, Hawaii, USA, vol. 33; 2019. p. 3470–7. Mukherjee S, Asnani H, Lin E, Kannan S. Clustergan: Latent space clustering in generative adversarial networks. In: Proc of the 33rd AAAI conference on artificial intelligence. Honolulu, Hawaii, USA; 2019. p. 4610–17. Miyato T, Koyama M. Cgans with projection discriminator. In: Proc of the 6th international conference on learning representations, ICLR. Vancouver, Canada; 2018. p. 1258–65. Huang X, Belongie S. Arbitrary style transfer in real-time with adaptive instance normalisation. In: Proc of the 19th IEEE international conference on computer vision, ICCV. Venice, Italy; 2017. p. 1501–10. Clevert D, Unterthiner T, Hochreiter S. Fast and accurate deep network learning by exponential linear units (ELUs). In: Proc of the 4th international conference on learning representations, ICLR. San Juan, Puerto Rico; 2016. p. 2569–78. Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks. In: Proc of the 10th international conference on artificial intelligence and statistics, ICAISC. Sardinia, Italy; 2010. p. 249–56. Hinton, 1995, The“ wake-sleep” algorithm for unsupervised neural networks, Science, 268, 1158, 10.1126/science.7761831 Kingma DP, Ba JL. Adam: A method for stochastic optimization. In: Proc of the 3rd international conference on learning representations, ICLR. San Diego, CA, USA; 2015. p. 1–15. Dowson, 1982, The fréchet distance between multivariate normal distributions, J Multivariate Anal, 12, 450, 10.1016/0047-259X(82)90077-X Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In: Proc of the 31st conference on neural information processing systems, NIPS. Long Beach, USA; 2017. p. 6626–37. Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. In: Proc of the 3rd international conference on learning representations, ICLR. San Diego, CA, USA; 2015. p. 310–18. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proc of the IEEE conference on computer vision and pattern recognition, CVPR. Las Vegas, USA; 2016. p. 770–8. Russakovsky, 2015, Imagenet large scale visual recognition challenge, Int J Comput Vis, 115, 211, 10.1007/s11263-015-0816-y Mathieu MF, Zhao JJ, Zhao J, Ramesh A, Sprechmann P, LeCun Y. Disentangling factors of variation in deep representation using adversarial training. In: Proc of the 30th international conference on neural information processing systems, NIPS. Red Hook, NY, USA; 2016. p. 5047–55. Hameed, 2021, Class distribution-aware adaptive margins and cluster embedding for classification of fruit and vegetables at supermarket self-checkouts, Neurocomputing, 461, 292, 10.1016/j.neucom.2021.07.040