PADI-web 3.0: A new framework for extracting and disseminating fine-grained information from the news for animal disease surveillance

One Health - Tập 13 - Trang 100357 - 2021
Sarah Valentin1,2,3, Elena Arsevska1,2, Julien Rabatel1, Sylvain Falala1,2, Alizé Mercier1,2, Renaud Lancelot1,2, Mathieu Roche1,3
1CIRAD, UMR ASTRE / UMR TETIS, F-34398 Montpellier, France
2ASTRE, Univ Montpellier, CIRAD, INRAE, Montpellier, France
3TETIS, Univ Montpellier, AgroParisTech, CIRAD, CNRS, INRAE, Montpellier, France

Tài liệu tham khảo

Keesing, 2010, Impacts of biodiversity on the emergence and transmission of infectious diseases, Nature, 468, 647, 10.1038/nature09575 Ostfeld, 2009, Biodiversity loss and the rise of zoonotic pathogens, Clin. Microbiol. Infect., 15, 40, 10.1111/j.1469-0691.2008.02691.x Langmuir, 1980, The epidemic intelligence Service of the Center for Disease Control, Public Health Rep., 95, 470 Kaiser, 2006, What is epidemic intelligence, and how is it being improved in Europe?, Weekly Releases(1997–2007), 11, 2892 Paquet, 2006, Epidemic intelligence: a new framework for strengthening disease surveillance in Europe, Eurosurveillance, 11, 5, 10.2807/esm.11.12.00665-en WHO, 2014 Alomar, 2016, Development and testing of the media monitoring tool MedISys for the monitoring, early identification and reporting of existing and emerging plant health threats, EFSA Supporting Publications, 13, 10.2903/sp.efsa.2016.EN-1118 Arsevska, 2018, Web monitoring of emerging animal infectious diseases integrated in the French animal health epidemic intelligence system, PLoS One, 13, 10.1371/journal.pone.0199960 Lyon, 2013, Using AquaticHealth.net to detect emerging trends in aquatic animal health, Agriculture, 3, 299, 10.3390/agriculture3020299 Lyon, 2013, Using internet intelligence to manage biosecurity risks: a case study for aquatic animal health, Divers. Distrib., 19, 640, 10.1111/ddi.12057 Barboza, 2013, On behalf of the early alerting, reporting project of the Global Health security initiative, evaluation of epidemic intelligence systems integrated in the early alerting and reporting project for the detection of A/H5N1 influenza events, PLoS One, 8, 10.1371/journal.pone.0057252 Rotureau, 2007, International epidemic intelligence at the Institut de Veille Sanitaire, France, Emerg. Infect. Dis., 13, 1590, 10.3201/eid1310.070522 Baker, 2007, The new international health regulations: a revolutionary change in global health security, The New Zealand Med. J., 120, U2872 Valentin, 2020, PADI-web: a multilingual event-based surveillance system for monitoring animal infectious diseases, Comput. Electron. Agric., 169, 105163, 10.1016/j.compag.2019.105163 Valentin, 2020, Padi-web: An event-based surveillance system for detecting, classifying and processing online news, 87 Valentin, 2021, Monitoring online media reports for early detection of unknown diseases: insight from a retrospective study of COVID-19 emergence, Transbound. Emerg. Dis., 68, 981, 10.1111/tbed.13738 Arsevska, 2017, PADI-web: platform for automated extraction of animal disease information from the web, 241 Mantero, 2011 Steinberger, 2008, Text mining from the web for medical intelligence Carter, 2020 Mooney, 2005, Mining knowledge from text using information extraction, ACM SIGKDD, 7, 3, 10.1145/1089815.1089817 Guarino, 2009, What is an ontology?, 1 Chanlekha, 2010, A framework for enhancing spatial and temporal granularity in report-based health surveillance systems, BMC Med. Informat. Dec. Making, 10, 1, 10.1186/1472-6947-10-1 Amitay, 2004, Web-a-where: geotagging web content, 273 Lafferty, 2001, Conditional random fields: probabilistic models for segmenting and labeling sequence data, 282 Manning, 2014, 55 Bird, 2004, NLTK: the natural language toolkit, 214 Song, 2019, Named entity recognition based on conditional random fields, Clust. Comput., 22, 1, 10.1007/s10586-017-1146-3 Inkpen, 2017, Location detection and disambiguation from twitter messages, J. Intell. Inf. Syst., 49, 237, 10.1007/s10844-017-0458-3 Honnibal, 2017, spaCy 2: natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing Li, 2003, Info Xtract location normalization: A hybrid approach to geographic references in information extraction, 39 Martins, 2008, Extracting and exploring the geo-temporal semantics of textual resources, 1 Arsevska, 2016, Identification of terms for detecting early signals of emerging infectious disease outbreaks on the web, Comput. Electron. Agric., 123, 104, 10.1016/j.compag.2016.02.010 Richardson, 2007, Beautiful soup documentation M. Research Conway, 2009, Classifying disease outbreak reports using N-grams and semantic features, Int. J. Med. Inform., 78, e47, 10.1016/j.ijmedinf.2009.03.010 Doan, 2007, The role of roles in classifying annotated biomedical text, 17 Torii, 2011, An exploratory study of a text classification framework for internet-based surveillance of emerging epidemics, Int. J. Med. Inform., 80, 56, 10.1016/j.ijmedinf.2010.10.015 Zhang, 2009, Automatic online news monitoring and classification for syndromic surveillance, Decis. Support. Syst., 47, 508, 10.1016/j.dss.2009.04.016 Valentin, 2019, Annotation of epidemiological information in animal disease-related news articles: guidelines and manually labelled corpus Rabatel, 2019, PADI-web corpus: labeled textual data in animal health domain, Data in Brief, 22, 643, 10.1016/j.dib.2018.12.063 Ahlers, 2013, Assessment of the accuracy of geonames gazetteer data, 74 Lossio-Ventura, 2016, Biomedical term extraction: overview and a new methodology, Informat. Ret. J., 19, 59, 10.1007/s10791-015-9262-2 Levenshtein, 1966, 10, 707 Lin, 1998, An information-theoretic definition of similarity, 296 Uysal, 2014, The impact of preprocessing on text classification, Inf. Process. Manag., 50, 104, 10.1016/j.ipm.2013.08.006 Salton, 1988, Term-weighting approaches in automatic text retrieval, Inf. Process. Manag., 24, 513, 10.1016/0306-4573(88)90021-0 Valentin, 2020 Velasco, 2014, Social media and internet-based data in global systems for public health surveillance: a systematic review, The Milbank Quart., 92, 7, 10.1111/1468-0009.12038 Cui, 2019, Regular expression based medical text classification using constructive heuristic approach, IEEE Access, 7, 147892, 10.1109/ACCESS.2019.2946622