Structured language modeling
Tài liệu tham khảo
Abney, 1999, Relating probabilistic grammars and automata
Bellegarda, 1997, A latent semantic analysis framework for large-span language modeling
Berger, 1996, A maximum entropy approach to natural language processing, Computational Linguistics, 22, 39
Brown, 1997, Class-based n -gram models of natural language, Computational Linguistics, 18, 467
W. Byrne, A. Gunawardana, S. Khudanpur, 1998, Department of Electical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, U.S.A
Charniak, 1997, Statistical parsing with a context-free grammar and word statistics
Chelba, 1997, A structured language model
Chelba, 1997, Structure and performance of a dependency language model
Chelba, 1998, Exploiting syntactic structure for language modeling
CLSP, 1997, WS97
Collins, 1996, A new statistical parser based on bigram lexical dependencies
S. Della Pietra, V. Della Pietra, J. Gillet, J. Lafferty, H. Printz, L. Ures, 1994, School of Computer Science, Carnegie Mellon University, Pittsburg, PA, U.S.A
Dempster, 1977, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, 39B, 1
Doug, 1992, The design for the Wall Street Journal-based CSR corpus
Godfrey, 1992, SWITCHBOARD telephone speech corpus for research and development
Grenadier, 1996, Constrained stochastic language models, 131
Haegeman, 1994
Jelinek, 1998
Jelinek, 1999
Jelinek, 1980, pp. 381
Jelinek, 1994, Decision tree parsing using a hidden derivational model
Jelinek, 1977, Perplexity—a measure of difficulty of speech recognition tasks, Journal of the Acoustic Society of America, 62, S63, Supplement 1
Jelinek, 1980
Marcus, 1995, Building a large annotated corpus of English: the Penn Treebank, Computational Linguistics, 19, 313
Ney, 1994, Large vocabulary continuous speech recognition of Wall Street Journal data
Nilsson, 1971
Ratnaparkhi, 1997, A linear observed time statistical parser based on maximum entropy models
R. Rosenfeld, 1994
Saul, 1997, Aggregate and mixed-order markov models for statistical language processing
Viterbi, 1967, Error bounds for convolutional codes and an asymmetrically optimum decoding algorithm, IEEE Transactions on Information Theory, IT-13, 260, 10.1109/TIT.1967.1054010
Wu, 1999, Combining nonlocal, syntactic and n -gram dependencies in language modeling