Morphology-Based Identification of <i>Bemisia tabaci</i> Cryptic Species Puparia via Embedded Group-Contrast Convolution Neural Network Analysis

Systematic Biology - Tập 71 Số 5 - Trang 1095-1109 - 2022
Norman MacLeod1, Roy Canty2,3, Andrew Polaszek3
1School of Earth Sciences and Engineering, Zhu Gongshan Building , 163 Xianlin Avenue, Nanjing, 210023 Jiangsu, China
2Department of Entomology, Staatliches Museum für Naturkunde, Rosenstein 1 , 70191 Stuttgart, Germany
3Department of Life Sciences, Natural History Museum, Cromwell Road, SW7 5BD, London, UK

Tóm tắt

Abstract The Bemisia tabaci species complex is a group of tropical–subtropical hemipterans, some species of which have achieved global distribution over the past 150 years. Several species are regarded currently as among the world’s most pernicious agricultural pests, causing a variety of damage types via direct feeding and plant-disease transmission. Long considered a single variable species, genetic, molecular and reproductive compatibility analyses have revealed that this “species” is actually a complex of between 24 and 48 morphologically cryptic species. However, determinations of which populations represent distinct species have been hampered by a failure to integrate genetic/molecular and morphological species–diagnoses. This, in turn, has limited the success of outbreak-control and eradication programs. Previous morphological investigations, based on traditional and geometric morphometric procedures, have had limited success in identifying genetic/molecular species from patterns of morphological variation in puparia. As an alternative, our investigation focused on exploring the use of a deep-learning convolution neural network (CNN) trained on puparial images and based on an embedded, group-contrast training protocol as a means of searching for consistent differences in puparial morphology. Fifteen molecular species were selected for analysis, all of which had been identified via DNA barcoding and confirmed using more extensive molecular characterizations and crossing experiments. Results demonstrate that all 15 species can be discriminated successfully based on differences in puparium morphology alone. This level of discrimination was achieved for laboratory populations reared on both hairy-leaved and glabrous-leaved host plants. Moreover, cross-tabulation tests confirmed the generality and stability of the CNN discriminant system trained on both ecophenotypic variants. The ability to identify B. tabaci species quickly and accurately from puparial images has the potential to address many long-standing problems in B. tabaci taxonomy and systematics as well as playing a vital role in ongoing pest-management efforts. [Aleyrodidae; entomology; Hemiptera; machine learning; morphometrics; pest control; systematics; taxonomy; whiteflies.]

Từ khóa


Tài liệu tham khảo

Arteaga, 2019, Interpretable machine learning for image classification with LIME, Toward Data Sci., 2019, 1

Basit, 2019, Status of insecticide resistance in Bemisia tabaci: resistance, cross-resistance, stability of resistance, genetics and fitness costs, Phytoparasitica, 47, 207, 10.1007/s12600-019-00722-5

Becht, 2019, Dimensionality reduction for visualizing single-cell data using UMAP, Nat. Biotechnol., 37, 38, 10.1038/nbt.4314

Bellows, 1994, Description of a species of Bemisia (Homoptera: Aleyrodidae), Ann. Entomol. Soc. Am., 87, 195, 10.1093/aesa/87.2.195

Bethke, 1991, Comparative biology, morphometrics, and development of two populations of Bemisia tabaci (Homoptera: Aleyrodidae) on cotton and poinsettia, Ann. Entomol. Soc. Am., 84, 407, 10.1093/aesa/84.4.407

Bielza, 2019, Spiromesifen and spirotetramat resistance in field populations of Bemisia tabaci Gennadius in Spain Pest Man Sci, 75, 45

Bookstein, 1991, Morphometric tools for landmark data: geometry and biology

Boykin, 2018, Review and guide to a future naming system of African Bemisia tabaci species, Syst. Entomol., 43, 427, 10.1111/syen.12294

Breiman, 2001, Random forests, Mach. Learn., 45, 5, 10.1023/A:1010933404324

Britten, 1986, Rates of DNA sequence evolution differ between taxonomic groups, Science, 231, 1393, 10.1126/science.3082006

Bromham, 2009, Why do species vary in their rate of molecular evolution?, Biol. Lett., 5, 401, 10.1098/rsbl.2009.0136

Chaubey, 2015, Geometric morphometrics of puparia of Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae) from Western Plateau and Hill regions agroclimatic zone of India, Ind. J. Entomol., 77, 283, 10.5958/0974-8172.2015.00058.9

Chi, 2020, Differential transmission of Sri Lankan cassava mosaic virus by three cryptic species of the whitefly Bemisia tabaci complex, Virology, 540, 141, 10.1016/j.virol.2019.11.013

Chicco, 2020, The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation, BMC Genomics., 21, 1, 10.1186/s12864-019-6413-7

Dayrat, 2005, Towards integrative taxonomy, Biol. J. Linn. Soc., 85, 407, 10.1111/j.1095-8312.2005.00503.x

De Barro, 2000, Mating interactions between two biotypes of the whitefly, Bemisia tabaci (Hemiptera: Aleyrodidae) in Australia, Bull. Entomol. Res., 90, 103, 10.1017/S0007485300000201

Delatte, 2007, A new silverleaf-inducing biotype Ms of Bemisia tabaci (Hemiptera: Aleyrodidae) indigenous to the islands of the south-west Indian Ocean, Bull. Entomol. Res., 95, 29, 10.1079/BER2004337

Delaunay, 2020, Improving anchor-based explanations, Proceedings of the 29th ACM International Conference on Information & Knowledge Management (CIKM ’20)., 3269

DeSalle, 2005, The unholy trinity: taxonomy, species delimitation and DNA barcoding, Philos. Trans. R. Soc. B, 360, 1905, 10.1098/rstb.2005.1722

Dinsdale, 2010, Refined global analysis of Bemisia tabaci (Hemiptera: Sternorrhyncha: Aleyrodoidea: Aleyrodidae) mitochondrial cytochrome oxidase 1 to identify species level genetic boundaries, Ann. Entomol. Soc. Am., 103, 196, 10.1603/AN09061

Dorrity, 2020, Dimensionality reduction by UMAP to visualize physical and genetic interactions, Nat. Commun., 11, 1, 10.1038/s41467-020-15351-4

Dzhembekova, 2020, Comparative analysis of morphological and molecular approaches integrated into the study of the dinoflagellate biodiversity within the recently deposited Black Sea sediments-benefits and drawbacks, Biodivers. Data J., 8, e55172, 10.3897/BDJ.8.e55172

Fiallo-Olivé, 2020, Transmission of begomoviruses and other whitefly-borne viruses: dependence on the vector species, Phytopathology, 110, 10, 10.1094/PHYTO-07-19-0273-FI

Forbes, 2019, Neural naturalist: generating fine-grained image comparisons, 708

Gill,, 1990, The morphology of whiteflies. In: Gerling, D., editor. Whiteflies: their bionomics, pest status and management, 13

Hall, 2014, Use of wing morphometrics to identify populations of the Old World screwworm fly, Chrysomya bezziana (Diptera: Calliphoridae): a preliminary study of the utility of museum specimens, Acta Trop., 138, 49, 10.1016/j.actatropica.2014.03.023

Hamada, 2019, Differential metabolism of imidacloprid and dinotefuran by Bemisia tabaci CYP6CM1 variants, Pest Biochem. Physiol., 159, 27, 10.1016/j.pestbp.2019.05.011

Hardy, 1940, A mathematician’s apology

Harris, 1988, A combined corner and edge detector, Proceedings of the 4th Alvey Vision Conference,, 147

Harish, 2016, Morphometric variations in Cvassava (Manihot esculenta Crantz) whitefly, Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae) from different agro-ecological zones of Kerala, India, J. Root Crops, 42, 90

Hebert, 2003, Biological identifications through DNA barcodes, Proc. Roy. Soc. B, 270, 313, 10.1098/rspb.2002.2218

Hodges, 2005, An identification guide to the whiteflies (Hemiptera: Aleyrodidae) of the Southeastern United States, Florida Entomol., 88, 518, 10.1653/0015-4040(2005)88[518:AIGTTW]2.0.CO;2

Hillis, 1987, Molecular versus morphological approaches to systematics, Annu. Rev. Ecol. Syst., 18, 23, 10.1146/annurev.es.18.110187.000323

Hoyal Cuthill, 2019, Deep learning on butterfly phenotypes tests evolution’s oldest mathematical model, Sci. Adv., 5, 1, 10.1126/sciadv.aaw4967

Hu, 2011, An extensive field survey combined with a phylogenetic analysis reveals rapid and widespread invasion of two alien whiteflies in China, PLoS One, 6, e16061, 10.1371/journal.pone.0016061

Hussain, 2019, Whole genome sequencing of Asia II 1 species of whitefly reveals that genes involved in virus transmission and insecticide resistance have genetic variances between Asia II 1 and MEAM1 species, BMC Genomics, 20, 1, 10.1186/s12864-019-5877-9

Jazzar, 2004, Efficacy of multiple biocontrol agents against the sweet potato whitefly Bemisia tabaci (Gennadius) (Homoptera: Aleyrodidae) on tomato, J. Appl. Entomol., 128, 188, 10.1111/j.1439-0418.2004.00833.x

Johnson, 2008, How to do everything: digital camera,, 5th ed.

Jhamtani, 2018, Learning to describe differences between pairs of similar images, 4024

Kanakala, 2019, Global genetic diversity and geographical distribution of Bemisia tabaci and its bacterial endosymbionts, PLoS One, 19, e0213946, 10.1371/journal.pone.0213946

Kendall, 1981, The statistics of shape, Interpreting multivariate data., 75

Kendall, 1984, Shape manifolds, procrustean metrics and complex projective spaces, Bull. Lond. Math. Soc., 16, 81, 10.1112/blms/16.2.81

Kliot, 2019, Combined infection with Tomato yellow leaf curl virus and Rickettsia influences fecundity, attraction to infected plants and expression of immunity-related genes in the whitefly Bemisia tabaci, J. Gen. Virol., 100, 721, 10.1099/jgv.0.001233

LeCun, 1998, Gradient-based learning applied to document recognition, Proc. IEEE, 86, 2278, 10.1109/5.726791

LeCun, 2015, Deep learning, Nature, 521, 436, 10.1038/nature14539

Lee, 2011, Barcoding aphids (Hemiptera: Aphididae) of the Korean Peninsula: updating the global data set, Mol. Ecol. Res., 11, 32, 10.1111/j.1755-0998.2010.02877.x

Lee, 2013, Taxonomic status of the Bemisia tabaci complex (Hemiptera: Aleyrodidae) and reassessment of the number of its constituent species, PLoS One, 13, e63817, 10.1371/journal.pone.0063817

Li, 2013, Comparative morphology and morphometry of six biotypes of Bemisia tabaci (Hemiptera: Aleyrodidae) from China, J. Integr. Agric., 12, 846, 10.1016/S2095-3119(13)60303-2

Liu, 2012, Species concepts as applied to the whitefly Bemisia tabaci systematics: how many species are there?, J. Integr. Agric., 11, 176, 10.1016/S2095-3119(12)60002-1

Liu, 2016, The suitability of biotypes Q and B of Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae) at different nymphal instars as hosts for Encarsia formosa Gahan (Hymenoptera: Aphelinidae), PeerJ, 4, e1863, 10.7717/peerj.1863

Lundberg, 2017, A unified approach to interpreting model predictions, 31st Conference on Neural Information Processing Systems (NIPS 2017), 10

MacLeod, 2007, Automated taxon identification in systematics: theory, approaches, and applications

MacLeod, 2015, The direct analysis of digital images (eigenimage) with a comment on the use of discriminant analysis in morphometrics, Proceedings of the Third International Symposium on Biological Shape Analysis., 156, 10.1142/9789814704199_0011

MacLeod, 2018, The quantitative assessment of archaeological artifact groups: Beyond geometric morphometrics, Quat. Sci. Rev., 201, 319, 10.1016/j.quascirev.2018.08.024

MacLeod, 2018, Towards the automated identification of Chrysomya blow flies from wing images, Med. Vet. Entomol., 32, 323, 10.1111/mve.12302

MacLeod, 2020, Machine-learning strategies for testing patterns of morphological variation in small samples: sexual dimorphism in gray wolf (Canis lupus) crania, BMC Biol., 18, 1, 10.1186/s12915-020-00832-1

MacLeod, 2005, Automated object recognition in systematics, Systemalist, 25, 14

Manly, 2006, Randomization, bootstrap and Monte Carlo methods in biology,, 3rd ed.

Manly, 2017, Multivariate statistical methods: a primer,, 4th ed

Manzari, 2006, A cladistic analysis of whiteflies, subfamily Aleyrodinae (Hemiptera: Sternorrhyncha: Aleyrodidae), J. Nat. Hist., 40, 44, 10.1080/00222930601121890

Marubayashi, 2014, Diversity and localization of bacterial endosymbionts from whitefly species collected in Brazil, PLoS One, 14, e108363, 10.1371/journal.pone.0108363

Matthews,, 1975, Comparison of the predicted and observed secondary structure of T4 phage lysozyme, Biochim. Biophys. Acta (BBA) - Protein Struct., 405, 442, 10.1016/0005-2795(75)90109-9

Meier, 2006, DNA barcoding and taxonomy in Diptera: a tale of high intraspecific variability and low identification success, Syst. Biol., 55, 715, 10.1080/10635150600969864

Meier, 2008, The use of mean instead of smallest interspecific distances exaggerates the size of the “barcoding gap” and leads to misidentification, Syst. Biol., 57, 809, 10.1080/10635150802406343

Molnar, 2020, Interpretable machine learning: a guide for making black box models explainable

Mound, 1963, Host-correlated variation in Bemisia tabaci (Gennadius) (Homoptera: Aleyrodidae), Proc. Roy. Entomol. Soc. Lond. A, 38, 171

Mugerwa, 2020, Whole-genome single nucleotide polymorphism and mating compatibility studies reveal the presence of distinct species in sub-Saharan Africa Bemisia tabaci whiteflies, Insect Sci., 28, 1553, 10.1111/1744-7917.12881

Mugerwa, 2018, African ancestry of New World, Bemisia tabaci-whitefly species, Sci. Rep., 8, 2734, 10.1038/s41598-018-20956-3

Navas-Castillo, 2011, Emerging virus diseases transmitted by whiteflies, Ann. Rev. Phytopath., 49, 219, 10.1146/annurev-phyto-072910-095235

Oliveira, 2001, History, current status, and collaborative research projects for Bemisia tabaci, Crop Protect, 20, 709, 10.1016/S0261-2194(01)00108-9

Packer, 2018, Validating taxonomic identifications in entomological research, Insect Conserv. Div., 11, 1, 10.1111/icad.12284

Palandaéiæ, 2017, Contrasting morphology with molecular data: an approach to revision of species complexes based on the example of European Phoxinus (Cyprinidae), BMC Evol. Biol., 17, 184, 10.1186/s12862-017-1032-x

Pan, 2018, Cotton leaf curl disease: which whitefly is the vector?, Phytopathology, 108, 1172, 10.1094/PHYTO-01-18-0015-R

Peixoto, 2018, When taxonomy and biological control researchers unite: species delimitation of Eadya parasitoids (Braconidae) and consequences for classical biological control of invasive paropsine pests of Eucalyptus, PLoS One, 18, e0201276, 10.1371/journal.pone.0201276

Polaszek, 2013, Wallaceaphytis: an unusual new genus of parasitoid wasp (Hymenoptera: Aphelinidae) from Borneo, J. Nat. Hist., 48, 1111, 10.1080/00222933.2013.852264

Qin, 2015, Further insight into reproductive incompatibility between putative cryptic species of the Bemisia tabaci whitefly complex, Insect Sci., 23, 215, 10.1111/1744-7917.12296

Ranaweera, 2009, Feasibility of computer-aided identification of foraminiferal tests, Mar. Micropaleontol., 72, 66, 10.1016/j.marmicro.2009.03.005

Ribeiro, 2016, “Why should I trust you?” Explaining the predictions of any classifier,, Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining., 1135, 10.1145/2939672.2939778

Ribeiro, 2018, Anchors: high-precision model-agnostic explanations, 32nd AAAI Conference on Artificial Intelligence AAAI., 1527

Rosen, 1986, The role of taxonomy in effective biological control programs, Agric. Ecosyst. Env., 15, 121, 10.1016/0167-8809(86)90085-X

Russell, 1957, Synonyms of Bemisia tabaci (Gennadius) (Homoptera: Aleyrodidae), Bull. Brook Entomol. Soc., 52, 122

Sánchez, 2018, An analysis and implementation of the Harris corner detector, Image Process. On Line, 8, 305, 10.5201/ipol.2018.229

Sanderson, 2002, Estimating absolute rates of molecular evolution and divergence times: A penalized likelihood approach, Mol. Biol. Evol., 19, 101, 10.1093/oxfordjournals.molbev.a003974

Schauff, 1998, The relevance of systematics to biological control: protecting the investment in research, Pest management - future challenges. Proceedings of the sixth Australasian applied entomological research conference, Brisbane, 29 September–2 October 1., 425

Schlick-Steiner, 2010, Integrative taxonomy: a multisource approach to exploring biodiversity, Ann. Rev. Entomol., 55, 421, 10.1146/annurev-ento-112408-085432

Shapley, 1951, The value of an n-person game, Rand Corp. Res. Memo, RM-670, 1

Sidney,, 2002, Applied photographic optics, 3rd ed.

Simonyan, 2014, Deep inside convolutional networks: visualising image classification models and saliency maps, 2nd International Conference on Learning Representations, ICLR 2014 - Work. Track Proceedings., 1

Sirisena, 2013, A modified technique for the preparation of specimens of Sternorryncha for taxonomic studies, Trop. Agric. Res., 24, 139

Smith, 2008, Rates of molecular evolution are linked to life history in flowering plants, Science, 322, 86, 10.1126/science.1163197

Stewart, 2020, Guide to interpretable machine learning, Toward Data Sci., 2020, 1

Szeliski, 2006, Image alignment and stitching, Handbook of mathematical models in computer vision,, 273, 10.1007/0-387-28831-7_17

Tan, 2009, From ‘cryptic specie’ to integrative taxonomy: an iterative process involving DNA sequences, morphology, and behaviour leads to the resurrection of Sepsis pyrrhosoma (Sepsidae: Diptera), Zool. Scrip., 39, 51, 10.1111/j.1463-6409.2009.00408.x

Tay, 2012, Will the real Bemisia tabaci please stand up?, PLoS One, 12, e50550, 10.1371/journal.pone.0050550

Tay, 2017, The trouble with MEAM2: implications of pseudogenes on species delimitation in the globally invasive Bemisia tabaci (Hemiptera: Aleyrodidae) cryptic species complex, Genome Biol. Evol., 9, 2732, 10.1093/gbe/evx173

Ueda, 2019, Accumulation of salicylic acid in tomato plant under biological stress affects oviposition preference of Bemisia tabaci, J. Plant Int., 14, 73

van der Maaten, 2008, Visualizing data using t-SNE, J. Mach. Learn. Res., 9, 2579

van der Maaten, 2009, Dimensionality reduction: a comparative review, Tilburg University Tech Rep: TiCC TR 2009–005

Vogler, 2007, Recent advances in DNA taxonomy, J. Zool. Syst. Evol. Res., 45, 1, 10.1111/j.1439-0469.2006.00384.x

Vyskočeilová, 2018, An integrative approach to discovering cryptic species within the Bemisia tabaci whitefly species complex, Sci. Rep., 8, 10886, 10.1038/s41598-018-29305-w

Vyskočilová, 2019, Relative polyphagy of “Mediterranean” cryptic Bemisia tabaci whitefly species and global pest status implications, J. Pest Manage., 92, 1071, 10.1007/s10340-019-01113-9

Vyskočilová, 2019, Integrative approach to discovering species diversity within the Mediterranean group of the Bemisia tabaci complex, PhD thesis, University of Greenwich.

Wan, 2020, NBDT: neural-backed decision trees, 1, 1

Wattenberg, 2016, How to use t-SNE effectively, Distill, 2016, 1

Wilf, 2016, Computer vision cracks the leaf code, Proc. Natl. Acad. Sci. USA, 113, 3305, 10.1073/pnas.1524473113

Wright, 2006, The road from Santa Rosalia: a faster tempo of evolution in tropical climates, Proc. Natl. Acad. Sci. USA, 103, 7718, 10.1073/pnas.0510383103

Zhu, 2020, On the performance of Matthews correlation coefficient (MCC) for imbalanced dataset, Pattern Recognit. Lett., 136, 71, 10.1016/j.patrec.2020.03.030