A unified language model for large vocabulary continuous speech recognition of Turkish

Signal Processing - Tập 86 - Trang 2844-2862 - 2006
Ebru Arısoy1, Helin Dutağacı1, Levent M. Arslan1
1Electrical and Electronic Engineering Department, Boğaziçi University, 34342 Bebek, İstanbul, Turkey

Tài liệu tham khảo

Oflazer, 1994, Turkish natural language processing initiative Oflazer, 1994, Spelling correction in agglutinative languages, 194 Oflazer, 1994, Two-level description of Turkish morphology, Literary Linguistic Comput., 9, 137, 10.1093/llc/9.2.137 Oflazer, 1996, Error-tolerant finite-state recognition with applications to morphological analysis and spelling correction, Comput. Linguistics, 22, 73 Hakkani-Tür, 2000, Statistical morphological disambiguation for agglutinative languages, vol. 1, 285 Kwon, 2003, Korean large vocabulary continuous speech recognition with morpheme-based recognition units, Speech Commun., 39, 287, 10.1016/S0167-6393(02)00031-6 Siivola, 2003, Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner, 2293 Kneissler, 2001, Speech recognition for huge vocabularies by using optimized sub-word units, 69 Arslan, 1999, Türkçe sürekli konuşma tanıma sisteminin sayı tanıma uygulaması, 64 Hacioglu, 2003, On lexicon creation for Turkish LVCSR, 1165 Carkı, 2000, Turkish LVCSR: Towards better speech recognition for agglutinative languages, 1563 Byrne, 2001, On large vocabulary continuous speech recognition of highly inflectional language-Czech, 487 Kanevsky, 1998, Statistical language model for inflected languages, US Patent No. 5,835,888,1998. Mengusoglu, 2001, Turkish LVCSR Fromkin, 2003 E. Erguvanlı, The function of word order in Turkish grammar, Ph.D. Thesis, University of California, Los Angeles, USA, 1979. O. Cetinoğlu, Prolog based natural language processing infrastructure for Turkish, M.Sc. Thesis, Boğaziçi University, İstanbul, Turkey, 2000. H. Dutağacı, Statistical language models for large vocabulary speech recognition, Turkish M.Sc. Thesis, Boğaziçi University, İstanbul, Turkey, 2002. Carter, 1996, Handling compound nouns in a Swedish speech-understanding system, 26 Ando, 2000, Mostly unsupervised statistical segmentation of Japanese, 241 Jurafsky, 2000 McCrum, 1992 Sproat, 1992 Clarkson, 1997, Statistical language modeling using the CMU-Cambridge toolkit, 2707 S. Young, D. Ollason, V. Valtchev, P. Woodland, The HTK Book (for HTK version 3.2), Entropic Cambridge Research Laboratory, 2002. J. Garofolo, C. Auzanne, E. Voorhees, The TREC spoken document retrieval track: A success story, in: NIST Special Publication 500-246: the Eighth Text Retrieval Conference (TREC 8), Gaithersburg, MD, 1999, pp. 107–130. Choi, 1998, SCAN—Speech content based audio navigator, 2867 Tuerk, 2000, The Cambridge University multimedia document retrieval demo system, 394 Abberley, 1999, The THISL broadcast news retrieval system, 19 Jelinek, 1997