A one-pass decoder based on polymorphic linguistic context assignment

IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01. - Trang 214-217

H. Soltau^1,2, F. Metze^1,2, C. Fugen^1,2, A. Waibel^1,2

¹Interactive Systems Laboratories, Carnegie Mellon University, USA

²Interactive Systems Laboratories, University of Karlsruhe, Germany

Tóm tắt

In this study, we examine how fast decoding of conversational speech with large vocabularies profits from efficient use of linguistic information, i.e. language models and grammars. Based on a re-entrant single pronunciation prefix tree, we use the concept of linguistic context polymorphism to allow an early incorporation of language model information. This approach allows us to use all available language model information in a one-pass decoder, using the same engine to decode with statistical n-gram language models as well as context free grammars or re-scoring of lattices in an efficient way. We compare this approach to our previous decoder, which needed three passes to incorporate all available information. The results on a very large vocabulary task show that the search can be speeded up by almost a factor of three, without introducing additional search errors.

Từ khóa

#Decoding #Context modeling #Automatic speech recognition #Vocabulary #Acoustic beams #Speech recognition #History #Interactive systems #Laboratories #Natural languages

Tài liệu tham khảo

alex, 2001, Advances in meeting recogniton, Proceedings of the First International Conference on Human Language Technology Conference aubert, 1999, One pass cross word decoding for large vocabularies based. on a lexical tree search organization, Proceedings of the Eurospeech ravishankar, 1996, Efficient Algorithms for Speech Recognition 10.1109/ICASSP.1996.543251 soltau, 2001, The ISL evaluation system for Verbmobil-IL, Proceedings of the ICASSP finke, 1999, Modeling and efficient decoding of large vocabulary conversational speech, Proc EU-ROSPEECH 10.1109/ICASSP.1996.540309 odell, 1995, The use of context in large vocabulary speech recognition ney, 1992, Improvements in beam search for 100000-word continuous speech recognition, Proceedings of the ICASSP 10.1109/ICASSP.1995.479666 10.1109/ICASSP.1998.675390 10.1109/ICASSP.1996.540308

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích ảnh hưởng của các bài báo, công bố khoa học Việt Nam và Quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ SciBase

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Hệ thống hội thảo khoa học Việt Nam

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA

Thông tin liên hệ & hỗ trợ

Đơn vị chủ quản, phát triển và vận hành: Công ty Cổ phần Metis

Địa chỉ liên hệ: 26A Lê Đức Thọ, Phường Từ Liêm, Thành phố Hà Nội

Số giấy chứng nhận ĐKKD: 0109293202 cấp ngày 03/08/2020 tại Sở Kế hoạch và Đầu tư thành phố Hà Nội

Người quản lý và chịu trách nhiệm nội dung: Nguyễn Ngọc Sơn

Hotline: 0566.685.688

Email: [email protected]