Structured language modeling

Computer Speech & Language - Tập 14 - Trang 283-332 - 2000
Ciprian Chelba1, Frederick Jelinek1
1Center for Language and Speech Processing, Johns Hopkins University, Baltimore, MD, U.S.A.

Tài liệu tham khảo

Abney, 1999, Relating probabilistic grammars and automata Bellegarda, 1997, A latent semantic analysis framework for large-span language modeling Berger, 1996, A maximum entropy approach to natural language processing, Computational Linguistics, 22, 39 Brown, 1997, Class-based n -gram models of natural language, Computational Linguistics, 18, 467 W. Byrne, A. Gunawardana, S. Khudanpur, 1998, Department of Electical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, U.S.A Charniak, 1997, Statistical parsing with a context-free grammar and word statistics Chelba, 1997, A structured language model Chelba, 1997, Structure and performance of a dependency language model Chelba, 1998, Exploiting syntactic structure for language modeling CLSP, 1997, WS97 Collins, 1996, A new statistical parser based on bigram lexical dependencies S. Della Pietra, V. Della Pietra, J. Gillet, J. Lafferty, H. Printz, L. Ures, 1994, School of Computer Science, Carnegie Mellon University, Pittsburg, PA, U.S.A Dempster, 1977, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, 39B, 1 Doug, 1992, The design for the Wall Street Journal-based CSR corpus Godfrey, 1992, SWITCHBOARD telephone speech corpus for research and development Grenadier, 1996, Constrained stochastic language models, 131 Haegeman, 1994 Jelinek, 1998 Jelinek, 1999 Jelinek, 1980, pp. 381 Jelinek, 1994, Decision tree parsing using a hidden derivational model Jelinek, 1977, Perplexity—a measure of difficulty of speech recognition tasks, Journal of the Acoustic Society of America, 62, S63, Supplement 1 Jelinek, 1980 Marcus, 1995, Building a large annotated corpus of English: the Penn Treebank, Computational Linguistics, 19, 313 Ney, 1994, Large vocabulary continuous speech recognition of Wall Street Journal data Nilsson, 1971 Ratnaparkhi, 1997, A linear observed time statistical parser based on maximum entropy models R. Rosenfeld, 1994 Saul, 1997, Aggregate and mixed-order markov models for statistical language processing Viterbi, 1967, Error bounds for convolutional codes and an asymmetrically optimum decoding algorithm, IEEE Transactions on Information Theory, IT-13, 260, 10.1109/TIT.1967.1054010 Wu, 1999, Combining nonlocal, syntactic and n -gram dependencies in language modeling