An improved union model for continuous speech recognition with partial duration corruption
Tóm tắt
The probabilistic union model is improved for continuous speech recognition involving partial duration corruption, assuming no knowledge about the corrupting noise. The new developments include: an n-best rescoring strategy for union based continuous speech recognition; a dynamic segmentation algorithm for reducing the number of corrupted segments in the union model; a combination of the union model with conventional noise-reduction techniques to accommodate the mixtures of stationary noise (e.g. car) and random, abrupt noise (e.g. a car horn). The proposed system has been tested for connected-digit recognition, subjected to various types of noise with unknown, time-varying characteristics. The results have shown significant robustness for the new model.
Từ khóa
#Speech recognition #Acoustic noise #Noise reduction #Speech enhancement #Signal to noise ratio #Time varying systems #Redundancy #Computer science #Heuristic algorithms #System testingTài liệu tham khảo
vizinho, 0, Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study, Eurospeech'99, 2407
drygajlo, 0, Speaker verification in noisy environment with combined spectral subtraction and missing data theory, ICASSP'98, 121
10.1016/S0167-6393(00)00045-5
seltzer, 0, Classifier-based mask estimate for missing feature method of robust speech recognition, ICSLP - 2000
0
ming, 0, A probabilistic union model for partial and temporal corruption of speech, IEEE ASRU Workshop, 43
lippmann, 0, Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise, Eurospeech 97, 37
10.1109/ICASSP.1997.596072