Advances in natural language processing

American Association for the Advancement of Science (AAAS) - Tập 349 Số 6245 - Trang 261-266 - 2015
Julia Hirschberg1, Christopher D. Manning2,3
1Department of Computer Science, Columbia University, New York, NY 10027, USA
2Department of Computer Science, Stanford University, Stanford, CA 94305-9020, USA.
3Department of Linguistics, Stanford University, Stanford, CA 94305-2150, USA

Tóm tắt

Natural language processing employs computational techniques for the purpose of learning, understanding, and producing human language content. Early computational approaches to language research focused on automating the analysis of the linguistic structure of language and developing basic technologies such as machine translation, speech recognition, and speech synthesis. Today’s researchers refine and make use of such tools in real-world applications, creating spoken dialogue systems and speech-to-speech translation engines, mining social media for information about health or finance, and identifying sentiment and emotion toward products and services. We describe successes and challenges in this rapidly advancing area.

Từ khóa


Tài liệu tham khảo

C. D. Manning M. Surdeanu J. Bauer J. Finkel S. J. Bethard D. McClosky “The Stanford CoreNLP Natural Language Processing Toolkit ” in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics System Demonstrations (Association for Computational Linguistics Stroudsburg PA 2014) pp. 55–60.

Linguistic Data Consortium www.ldc.upenn.edu/.

CoNLL Shared Tasks http://ifarm.nl/signll/conll/.

Kaggle www.kaggle.com.

Brown P. F., Della Pietra S. A., Della Pietra V. J., Mercer R. L., The mathematics of statistical machine translation: Parameter estimation. Comput. Linguist. 19, 263–311 (1993).

P. Koehn F. J. Och D. Marcu “Statistical phrase-based translation ” in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (Association for Computational Linguistics Stroudsburg PA 2003) pp. 48–54.

D. Chiang “A hierarchical phrase-based model for statistical machine translation ” Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics (Association for Computational Linguistics Stroudsburg PA 2005) pp. 263–270.

M. Galley M. Hopkins K. Knight D. Marcu “What’s in a translation rule?” in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2004) (Association for Computational Linguistics Stroudsburg PA 2004).

B. Jones J. Andreas D. Bauer K. M. Hermann K. Knight “Semantics-based machine translation with hyperedge replacement grammars ” in Proceedings of COLING 2012 (Technical Papers The COLING 2012 Organizing Committee Mumbai India 2012) pp. 1359–1376.

I. Sutskever O. Vinyals Q. V. Le “Sequence to sequence learning with neural networks ” in Advances in Neural Information Processing Systems 27 (NIPS 2014) Z. Ghahramani M. Welling C. Cortes N. D. Lawrence K. Q. Weinberger Eds. (Curran Associates Red Hook NY 2014) pp. 3104–3112.

D. Bahdanau K. Cho Y. Bengio “Neural machine translation by jointly learning to align and translate ” http://arxiv.org/abs/1409.0473 (2015).

M.-T. Luong I. Sutskever Q. V. Le O. Vinyals W. Zaremba “Addressing the rare word problem in neural machine translation ” http://arxiv.org/abs/1410.8206 (2015).

S. Jean K. Cho R. Memisevic Y. Bengio “On using very large target vocabulary for neural machine translation ” http://arxiv.org/abs/1412.2007 (2015).

S. Stymne C. Hardmeier J. Tiedemann J. Nivre “Feature weight optimization for discourse-level SMT ” in Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT) (Association for Computational Linguistics Stroudsburg PA 2013) pp. 60–69.

S. Green J. Chuang J. Heer C. D. Manning “Predictive translation memory: A mixed-initiative system for human language translation ” in Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology Honolulu HI 5 to 8 October 2014 (Association for Computing Machinery New York 2014) pp. 177–187.

S. Rosenthal J. Biswas M. Veloso “An effective personal mobile robot agent through symbiotic human-robot interaction ” in Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010) Toronto Canada 10 to 14 May 2010 (International Foundation for Autonomous Agents and Multiagent Systems Richland SC 2010) pp. 915–922.

10.5898/JHRI.2.2.Fasola

M. Core H. C. Lane D. Traum “Intelligent tutoring support for learners interacting with virtual humans ” in Design Recommendations for Intelligent Tutoring Systems (U.S. Army Research Laboratory Orlando FL 2014) vol. 2 pp. 249–257.

D. DeVault R. Artstein G. Benn T. Dey E. Fast A. Gainer K. Georgila J. Gratch A. Hartholt M. Lhommet G. Lucas S. Marsella F. Morbini A. Nazarian S. Scherer G. Stratou A. Suri D. Traum R. Wood Y. Xu A. Rizzo L.-P. Morency “SimSensei Kiosk: A virtual human interviewer for healthcare decision support ” in Proceedings of the 13th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2014) Paris France 5 to 9 May 2014 (International Foundation for Autonomous Agents and Multiagent Systems Richland SC 2014) pp. 1061–1068; http://aamas2014.lip6.fr/proceedings/aamas/p1061.pdf.

10.1109/MSP.2012.2205597

10.1145/365153.365168

Y. Nonaka Y. Sakai K. Yasuda Y. Nakano “Towards assessing the communication responsiveness of people with dementia ” in 12th International Conference on Intelligent Virtual Agents (IVA'12) (Springer Berlin 2012) pp. 496–498.

10.1006/ijhc.1995.1042

H. Giles A. Mulac J. J. Bradac P. Johnson “Speech accommodation theory: The next decade and beyond ” in Communication Yearbook (Sage Newbury Park CA 1987) vol. 10 pp. 13–48.

10.1109/JPROC.2012.2225812

Wikipedia www.wikipedia.org/.

10.1016/j.molcel.2006.02.012

A. Culotta J. Sorensen “Dependency tree kernels for relation extraction ” in Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (Association for Computational Linguistics Stroudsburg PA 2004) pp. 423–429.

10.1093/bioinformatics/btl616

10.1111/j.1467-8640.2011.00399.x

10.1371/journal.pone.0055814

10.1038/75556

PaleoBiology Database https://paleobiodb.org/.

10.1016/j.jbi.2012.08.001

10.1006/ijhc.1995.1042

Freebase www.freebase.com/.

dbpedia http://dbpedia.org/.

Wikidata www.wikidata.org/.

M. Mintz S. Bills R. Snow D. Jurafsky “Distant supervision for relation extraction without labeled data ” in Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (Association for Computational Linguistics Stroudsburg PA 2009) vol. 2 pp. 1003–1011.

M. Surdeanu J. Tibshirani R. Nallapati C. D. Manning “Multi-instance multi-label learning for relation extraction ” in Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing and Natural Language Learning (EMNLP-CoNLL) Jeju Island South Korea 12 to 14 July 2012 (Association for Computational Linguistics Stroudsburg PA 2012) pp. 455–465.

B. Min R. Grishman L. Wan C. Wang D. Gondek “Distant supervision for relation extraction with an incomplete knowledge base ” in Proceedings of NAACL-HLT 2013 Atlanta GA 9 to 14 June 2013 (Association for Computational Linguistics Stroudsburg PA 2013) pp. 777–782.

DeepDive http://deepdive.stanford.edu/.

10.1371/journal.pone.0113523

E. Etzioni M. Banko M. J. Cafarella “Machine reading ” in Proceedings of the 21st National Conference on Artificial Intelligence (AAAI 2006) Boston MA 16 to 20 July 2006 (AAAI Press Menlo Park CA 2006) vol. 2 pp. 1517–1519.

M. Banko M. J. Cafarella S. Soderland M. Broadhead O. Etzioni “Open information extraction from the web ” in Proceedings of the 20th International Joint Conference on Artifical Intelligence (IJCAI 2007) (Morgan Kaufmann San Francisco 2007) pp. 2670–2676.

O. Etzioni A. Fader J. Christensen S. Soderland Mausam “Open information extraction: The second generation ” in Proceedings of the 22nd International Joint Conference on Artificial Intelligence Barcelona Spain 16 to 22 July 2011 (AAAI Press Menlo Park CA 2011) pp. 3–10.

S. Riedel L. Yao A. McCallum B. M. Marlin “Relation extraction with matrix factorization and universal schemas ” in Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics (HLT NAACL 2013) (Stroudsburg PA 2013) pp. 74–84.

G. Angeli C. D. Manning “NaturalLI: Natural logic inference for common sense reasoning ” in Proceedings of the 2014 Conference on Emprical Methods in Natural Language Processing Doha Qatar 25 to 29 October 2014 (Association for Computational Linguistics Stroudsburg PA 2014) pp. 534–545.

J. Berant V. Srikumar P.-C. Chen A. Vander Linden B. Harding B. Huang P. Clark C. D. Manning “Modeling biological processes for reading comprehension ” in Proceedings of the 2014 Conference on Emprical Methods in Natural Language Processing Doha Qatar 25 to 29 October 2014 (Association for Computational Linguistics Stroudsburg PA 2014) pp. 1499–1510.

A. Fader L. Zettlemoyer O. Etzioni “Open question answering over curated and extracted knowledge bases ” in Proceedings of the Conference on Knowledge Discovery and Data Mining (KDD) (Association for Computing Machinery New York 2014) pp. 1156–1165.

M. A. Russell Mining the Social Web: Data Mining Facebook Twitter LinkedIn Google+ GitHub and More (O’Reilly Media Sebastopol CA ed. 2 2013).

N. Elhadad L. Gravano D. Hsu S. Balter V. Reddy H. Waechter “Information extraction from social media for public health ” in KDD at Bloomberg Workshop Data Frameworks Track (KDD 2014) (Association for Computing Machinery New York 2014).

M. Ott C. Cardie J. T. Hancock “Estimating the prevalence of deception in online review communities.” in Proceedings of the 21st International Conference on World Wide Web Conference Lyon France 16 to 20 April 2012 (Association for Computing Machinery New York 2012) pp. 201–210.

J. Liscombe thesis Columbia University (2007).

10.1007/s10579-005-7880-9

C. Whissell “The dictionary of affect in language ” in Emotion: Theory Research and Experience R. Plutchik H. Kellerman Eds. (Academic Press London 1989).

10.1177/0261927X09351676

10.1109/TASL.2010.2041113

B. Pang L. Lee S. Vaithyanathan “Thumbs up? Sentiment classification using machine learning techniques ” in Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing Philadelphia PA July 2002 (Association for Computational Linguistics Stroudsburg PA 2002) vol. 10 pp. 79–86.

H. Wang M. Ester “A sentiment-aligned topic model for product aspect rating prediction ” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing Doha Qatar 25 to 29 October 2014 (Association for Computational Linguistics Stroudsburg PA 2014) pp. 1192–1202.

M. Thomas Bo Pang L. Lee “Get out the vote: Determining support or opposition from Congressional floor-debate transcripts ” in Proceedings of the 2006 Conference on Emprical Methods in Natural Language Processing Sydney Australia 22 to 23 July 2006 (Association for Computational Linguistics Stroudsburg PA 2006) pp. 327–335.

J. Bollen H. Mao A. Pepe “Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena ” Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media Barcelona Spain 17 to 21 July 2011 (AAAI Press Menlo Park 2011) pp. 450–453.

R. Gonzalez-Ibanez S. Muresan N. Wacholder “Identifying sarcasm in Twitter: A closer look ” in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics Portland Oregon 19 to 24 June 2011 (Association for Computational Linguistics Stroudsburg PA 2011) pp. 581–586.

O. Biran S. Rosenthal J. Andreas K. McKeown O. Rambow “Detecting influencers in written online conversations ” in Proceedings of the 2012 Workshop on Language in Social Media Montreal Canada 7 June 2012 (Association for Computational Linguistics Stroudsburg PA 2012) pp. 37–45.

L.-C. Yu C.-Y. Ho “Identifying emotion labels from psychiatric social texts using independent component analysis ” in Proceedings of COLING 2014 (Technical Papers Association for Computational Linguistics Stroudsburg PA 2014) pp. 837–847.

10.1017/S0952675706000765

10.1016/j.cognition.2007.05.006

N. D. Goodman D. Lassiter “Probabilistic semantics and pragmatics: Uncertainty in language and thought ” in Handbook of Contemporary Semantics C. Fox S. Lappin Eds. (Blackwell Hoboken NJ ed. 2 2015).