A comparative study of model-based adaptation techniques for a compact speech recognizer

IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01. - Trang 29-32

F. Thiele¹, R. Bippus¹

¹Philips Research Laboratories, Aachen, Germany

Tóm tắt

Many techniques for speaker adaptation have been successfully applied to automatic speech recognition. This paper compares the performance of several adaptation methods with respect to their memory need and processing demand. For adaptation of a compact acoustic model with 4k densities, eigenvoices and structural MAP (SMAP) are investigated next to the well-known techniques of MAP (maximum a posteriori) and MLLR (maximum likelihood linear regression) adaptation. Experimental results are reported for unsupervised on-line adaptation on different amounts of adaptation data ranging from 4 to 500 words per speaker. The results show that for small amounts of adaptation data it might be more efficient to employ a larger baseline acoustic model without adaptation. Eigenvoices achieve the lowest word error rates of all adaptation techniques but SMAP presents a good compromise between memory requirement and accuracy.

Từ khóa

#Adaptation model #Speech recognition #Loudspeakers #Automatic speech recognition #Maximum likelihood linear regression #Laboratories #Error analysis #Command and control systems #Degradation #Regression tree analysis

Tài liệu tham khảo

huang, 1991, Im-proved Acoustic Modeling with the SPHINX Speech Recognition System, Proc ICASSP, 1, 345 gao, 1997, Speaker Adaptation Based on Pre-Clustering Training Speakers, 5th Eurospeech, 4, 2091, 10.21437/Eurospeech.1997-553 10.1109/ICASSP.2001.940840 10.1109/89.279278 10.1109/89.906001 yamaguchi, 1994, Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition, Proc ICSLP, 2, 791 siohan, 2000, Structural Maximum A Posteriori Linear Regression for Fast HMM Adaptation, Proc ISCA ITRW ASR2000 Automatic Speech Recognition Challenges for the Next Millenium, 120 10.1109/ICASSP.1999.759781 10.1006/csla.1995.0010 10.1109/ICASSP.1999.759778 kuhn, 1998, Eigenvoices for speaker adaptation, Proc ICSLP, 5, 1771 10.1109/89.650310

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA