An improved union model for continuous speech recognition with partial duration corruption

Ji Ming1
1School of Computer Science, Queen’s University Belfast, Belfast, UK

Tóm tắt

The probabilistic union model is improved for continuous speech recognition involving partial duration corruption, assuming no knowledge about the corrupting noise. The new developments include: an n-best rescoring strategy for union based continuous speech recognition; a dynamic segmentation algorithm for reducing the number of corrupted segments in the union model; a combination of the union model with conventional noise-reduction techniques to accommodate the mixtures of stationary noise (e.g. car) and random, abrupt noise (e.g. a car horn). The proposed system has been tested for connected-digit recognition, subjected to various types of noise with unknown, time-varying characteristics. The results have shown significant robustness for the new model.

Từ khóa

#Speech recognition #Acoustic noise #Noise reduction #Speech enhancement #Signal to noise ratio #Time varying systems #Redundancy #Computer science #Heuristic algorithms #System testing

Tài liệu tham khảo

vizinho, 0, Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study, Eurospeech'99, 2407 drygajlo, 0, Speaker verification in noisy environment with combined spectral subtraction and missing data theory, ICASSP'98, 121 10.1016/S0167-6393(00)00045-5 seltzer, 0, Classifier-based mask estimate for missing feature method of robust speech recognition, ICSLP - 2000 0 ming, 0, A probabilistic union model for partial and temporal corruption of speech, IEEE ASRU Workshop, 43 lippmann, 0, Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise, Eurospeech 97, 37 10.1109/ICASSP.1997.596072