Topic detection using paragraph vectors to support active learning in systematic reviews

Journal of Biomedical Informatics - Tập 62 - Trang 59-65 - 2016
Kazuma Hashimoto1, Georgios Kontonatsios2, Makoto Miwa3, Sophia Ananiadou2
1Graduate School of Engineering, University of Tokyo, Tokyo, Japan
2School of Computer Science, National Centre for Text Mining, University of Manchester, Manchester, United Kingdom
3Department of Advanced Science and Technology, Toyota Technological Institute, Nagoya, Japan

Tóm tắt

Từ khóa


Tài liệu tham khảo

Gough, 2012

Chalmers, 2002, A brief history of research synthesis, Eval. Health Prof., 25, 12, 10.1177/0163278702025001003

O’Mara-Eves, 2015, Using text mining for study identification in systematic reviews: a systematic review of current approaches, Syst. Rev., 4, 5, 10.1186/2046-4053-4-5

Miwa, 2014, Reducing systematic review workload through certainty-based screening, J. Biomed. Inform., 51, 242, 10.1016/j.jbi.2014.06.005

Wallace, 2010, Semi-automated screening of biomedical citations for systematic reviews, BMC Bioinform., 11, 55, 10.1186/1471-2105-11-55

Cohen, 2006, Reducing workload in systematic review preparation using automated citation classification, J. Am. Med. Inform. Assoc., 13, 206, 10.1197/jamia.M1929

Beahler, 2000, Information retrieval in systematic reviews: challenges in the public health arena, Am. J. Prev. Med., 18, 6, 10.1016/S0749-3797(00)00135-5

Blei, 2003, Latent Dirichlet allocation, J. Mach. Learn. Res., 3, 993

Hofmann, 1999, Probabilistic latent semantic indexing, 50

Le, 2014, Distributed representations of sentences and documents, 1188

Mikolov, 2013, Distributed representations of words and phrases and their compositionality, 3111

Collobert, 2011, Natural language processing (almost) from scratch, J. Mach. Learn. Res., 12, 2493

Turian, 2010, Word representations: a simple and general method for semi-supervised learning, 384

Turney, 2010, From frequency to meaning: vector space models of semantics, J. Artif. Intell. Res., 37, 141, 10.1613/jair.2934

Dai, 2014, Document embedding with paragraph vectors

Wallach, 2006, Topic modeling: beyond bag-of-words, 977

Wang, 2007, Topical n-grams: phrase and topic discovery, with an application to information retrieval, 697

Dhillon, 2001, Efficient clustering of very large document collections, 357

Fu, 2011, Certainty-enhanced active learning for improving imbalanced data classification, 405

A.K. McCallum, Mallet: A Machine Learning for Language Toolkit. <http://mallet.cs.umass.edu>.

Fan, 2008, Liblinear: a library for large linear classification, J. Mach. Learn. Res., 9, 1871

Wallace, 2010, Modeling annotation time to reduce workload in comparative effectiveness reviews, 28

Yu, 2008, GAPscreener: an automatic tool for screening human genetic association literature in PubMed using the support vector machine technique, BMC Bioinform., 9, 205, 10.1186/1471-2105-9-205

US National Library of Medicine National Institutes of Health. <http://www.ncbi.nlm.nih.gov/> (accessed 2016-01-05).

Embase: Biomedical Answers. <http://www.embase.com/> (accessed 2016-01-05).

Fineout-Overholt, 2005, Transforming health care from the inside out: advancing evidence-based practice in the 21st century, J. Prof. Nurs., 21, 335, 10.1016/j.profnurs.2005.10.005