Searching for the missing piece [speech recognition]
Tóm tắt
The tree-trellis forward-backward algorithm has been widely used for N-best searching in continuous speech recognition. In conventional approaches, the heuristic score used for the A* backward search is derived from the partial-path scores recorded during the forward pass. The inherently delayed use of a language model in the lexical tree structure leads to inefficient pruning and the partial-path score recorded is an underestimated heuristic score. This paper presents a novel method of computing the heuristic score that is more accurate than the partial-path score. The goal is to recover high-score sentence hypotheses that may have been pruned halfway during the forward search due to the delayed use of the LM. For the application of Hong Kong stock information inquiries, the proposed technique shows a noticeable performance improvement. In particular, a relative error-rate reduction of 12% has been achieved for top-1 sentences.
Từ khóa
#Lattices #Delay estimation #Speech #Tree data structures #Viterbi algorithmTài liệu tham khảo
10.1109/ICASSP.1991.150436
choi, 2000, Lexical tree decoding with a class-based language model for Chinese speech recognition, Proc ICSLP 2000, 1, 174
wong, 2000, Large Vocabulary Continuous Speech Recognition for Cantonese
ström, 0, Continuous speech recognition in the WAXHOLM dialogue system, Quarterly Progress and Status Report Department of Speech Music and Hearing KTH No 4/1996, 67
10.1109/ICASSP.1991.150437
10.1006/csla.1994.1002