Induced lexico-syntactic patterns improve information extraction from online medical forums

Gupta, Sonal1, MacLean, Diana L1, Heer, Jeffrey1, Manning, Christopher D1
1Department of Computer Science, Stanford University, Stanford, California, USA

Tóm tắt

Từ khóa


Tài liệu tham khảo

Fox SDuggan M. Health Online. Pew Internet and American Life Project. 2013. http://www.pewinternet.org/Reports/2013/Health-online.aspx

citation_title=Towards internet-age pharmacovigilance: extracting adverse drug reactions from user posts to health-related social networks; citation_author=Leaman R; citation_author=Wojtulewicz L; citation_author=Sullivan R; citation_publisher=Association for Computational Linguistics, ; citation_year=2010; citation_pages=117-25;

citation_title=Google trends: a web-based tool for real-time surveillance of disease outbreaks; citation_author=Carneiro HA; citation_author=Mylonakis E; citation_journal_title=Clin Infect Dis; citation_year=2009; citation_volume=49; citation_pages=1557-64;

citation_title=When Google got flu wrong; citation_author=Butler D; citation_journal_title=Nature; citation_year=2013; citation_volume=494; citation_pages=155-6;

citation_title=Web-scale pharmacovigilance: listening to signals from the crowd; citation_author=White RW; citation_author=Tatonetti NP; citation_author=Shah NH; citation_journal_title=J Am Med Inform Assoc; citation_year=2013; citation_volume=20; citation_pages=404-8;

citation_title=Accelerated clinical discovery using self-reported patient data collected online and a patient-matching algorithm; citation_author=Wicks P; citation_author=Vaughan TE; citation_author=Massagli MP; citation_journal_title=Nat Biotechnol; citation_year=2011; citation_volume=29; citation_pages=411-14;

citation_title=PatientsLikeMe: consumer health vocabulary as a folksonomy; citation_author=Smith CA; citation_author=Wicks PJ; citation_publisher=American Medical Informatics Association, ; citation_year=2008; citation_pages=682

citation_title=Automatic recognition of multi-word terms: the C-value/NC-value method; citation_author=Frantzi K; citation_author=Ananiadou S; citation_author=Mima H; citation_journal_title=Int J Digit Libr; citation_year=2000; citation_volume=3; citation_pages=115-30;

citation_title=Identifying medical terms in patient-authored text: a Crowdsourcing-based approach; citation_author=MacLean D; citation_author=Heer J; citation_journal_title=J Am Med Inform Assoc; citation_year=2013; citation_volume=20; citation_pages=1120-7;

citation_title=Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program; citation_author=Aronson AR; citation_publisher=American Medical Informatics Association, ; citation_year=2001; citation_pages=17

citation_title=The open biomedical annotator; citation_author=Jonquet C; citation_author=Shah NH; citation_author=Musen MA; citation_journal_title=Summit Transl Bioinform; citation_year=2009; citation_volume=2009; citation_pages=56-60;

Apache cTakes. http://ctakes.apache.org

citation_title=An overview of MetaMap: historical perspective and recent advances; citation_author=Aronson AR; citation_author=Lang FM; citation_journal_title=J Am Med Inform Assoc; citation_year=2010; citation_volume=17–21; citation_pages=229-36;

citation_title=A study of biomedical concept identification: MetaMap vs. people; citation_author=Pratt W; citation_author=Yetisgen-Yildiz M; citation_publisher=American Medical Informatics Association, ; citation_year=2003; citation_pages=529-33;

citation_title=Automated identification of drug and food allergies entered using non-standard terminology; citation_author=Epstein RH; citation_author=St Jacques P; citation_author=Stockin M; citation_journal_title=J Am Med Inform Assoc; citation_year=2013; citation_volume=20; citation_pages=962-8;

citation_title=Using rule-based natural language processing to improve disease normalization in biomedical text; citation_author=Kang N; citation_author=Singh B; citation_author=Afzal Z; citation_journal_title=J Am Med Inform Assoc; citation_year=2012; citation_volume=20; citation_pages=876-81;

Open Access, Collaborative Consumer Health Vocabulary Initiative. http://www.consumerhealthvocab.org (accessed Feb 2013 ).

MedlinePlus XML Files. http://www.nlm.nih.gov/medlineplus/xml.html

citation_title=Exploring and developing consumer health vocabularies; citation_author=Zeng QT; citation_author=Tse T; citation_journal_title=J Am Med Inform Assoc; citation_year=2006; citation_volume=13; citation_pages=24-9;

citation_title=Automatic acquisition of hyponyms from large text corpora; citation_author=Hearst MA; citation_publisher=Association for Computational Linguistics, ; citation_year=1992; citation_pages=539-45;

citation_title=A Bootstrapping method for learning semantic lexicons using extraction pattern contexts; citation_author=Thelen M; citation_author=Riloff E; citation_year=2002; citation_pages=214-21;

citation_title=Unsupervised method for automatic construction of a disease dictionary from a large free text collection; citation_author=Xu R; citation_author=Supekar K; citation_author=Morgan A; citation_publisher=American Medical Informatics Association, ; citation_year=2008; citation_volume=2008; citation_pages=820-4;

MedHelp. Data spans from 2007 to May 2011. http://www.medhelp.org

Stanford CoreNLP Toolkit. http://nlp.stanford.edu/software/corenlp.shtml (accessed Aug 2013 ).

RxList. http://www.rxlist.com (accessed Jan 2013 ).

MedlinePlus.. http://www.nlm.nih.gov/medlineplus (accessed Jan 2013 ).

MedicineNet. http://www.medicinenet.com (accessed Jan 2013 ).

MedDRA: Medical Dictionary for Regulatory Activities. http://www.meddra.org (accessed Feb 2013 ).

NCI Thesaurus. Semantic types accessed: Antibiotic, Clinical Drug, Laboratory Procedure, Medical Device, Steroid, and Therapeutic or Preventive Procedure. http://ncit.nci.nih.gov (accessed Mar 2013 ).

Google N-grams. http://storage.googleapis.com/books/ngrams/books/datasetsv2.html (accessed Jan 2008 ).

WebMD. http://www.webmd.com (accessed Oct 2013 ).

citation_author=Liang P; citation_publisher=MIT EECS, ; citation_title=Semi-supervised learning for natural language; citation_year=2005;

citation_title=Class-based n-gram models of natural language; citation_author=Brown PF; citation_author=deSouza PV; citation_author=Mercer RL; citation_journal_title=Comput Linguist; citation_year=1992; citation_volume=18; citation_pages=467-79;

citation_author=Finkel JR; citation_author=Grenager T; citation_author=Manning CD; citation_publisher=Association of Computational Linguistics, ; citation_title=Incorporating non-local information into information extraction systems by Gibbs sampling; citation_year=2005; citation_pages=363-70;

citation_title=Hypoglycemic effect of Opuntia streptacantha Lemaire in NIDDM; citation_author=Frati-Munari AC; citation_author=Gordillo BE; citation_author=Altamirano P; citation_journal_title=Diabetes Care; citation_year=1998; citation_volume=11; citation_pages=63-6;

citation_title=Cinnamon improves glucose and lipids of people with type 2 diabetes; citation_author=Khan A; citation_author=Safdar M; citation_author=Ali Khan M; citation_journal_title=Diabetes Care; citation_year=2003; citation_volume=26; citation_pages=3215-18;

citation_title=A novel signal detection algorithm for identifying hidden drug-drug interactions in adverse event reports; citation_author=Tatonetti NP; citation_author=Fernald GH; citation_author=Altman RB; citation_journal_title=J Am Med Inform Assoc; citation_year=2012; citation_volume=19; citation_pages=79-85;