A one-pass decoder based on polymorphic linguistic context assignment

H. Soltau1,2, F. Metze1,2, C. Fugen1,2, A. Waibel1,2
1Interactive Systems Laboratories, Carnegie Mellon University, USA
2Interactive Systems Laboratories, University of Karlsruhe, Germany

Tóm tắt

In this study, we examine how fast decoding of conversational speech with large vocabularies profits from efficient use of linguistic information, i.e. language models and grammars. Based on a re-entrant single pronunciation prefix tree, we use the concept of linguistic context polymorphism to allow an early incorporation of language model information. This approach allows us to use all available language model information in a one-pass decoder, using the same engine to decode with statistical n-gram language models as well as context free grammars or re-scoring of lattices in an efficient way. We compare this approach to our previous decoder, which needed three passes to incorporate all available information. The results on a very large vocabulary task show that the search can be speeded up by almost a factor of three, without introducing additional search errors.

Từ khóa

#Decoding #Context modeling #Automatic speech recognition #Vocabulary #Acoustic beams #Speech recognition #History #Interactive systems #Laboratories #Natural languages

Tài liệu tham khảo

alex, 2001, Advances in meeting recogniton, Proceedings of the First International Conference on Human Language Technology Conference aubert, 1999, One pass cross word decoding for large vocabularies based. on a lexical tree search organization, Proceedings of the Eurospeech ravishankar, 1996, Efficient Algorithms for Speech Recognition 10.1109/ICASSP.1996.543251 soltau, 2001, The ISL evaluation system for Verbmobil-IL, Proceedings of the ICASSP finke, 1999, Modeling and efficient decoding of large vocabulary conversational speech, Proc EU-ROSPEECH 10.1109/ICASSP.1996.540309 odell, 1995, The use of context in large vocabulary speech recognition ney, 1992, Improvements in beam search for 100000-word continuous speech recognition, Proceedings of the ICASSP 10.1109/ICASSP.1995.479666 10.1109/ICASSP.1998.675390 10.1109/ICASSP.1996.540308