Linear input network for neural network automata model adaptation

F. Mana1, R. Gemello1
1Via Nole, Loquendo S. p. A., Torino, Italy

Tóm tắt

The paper describes an experimental investigation of the applicability of linear input networks (LIN) as a channel and noise adaptation technique for an application of the Loquendo neural network based speech recognizer in a car environment. The considered application is an automated call center that provides traffic information through a voice dialogue system. The connection to the call center is achieved by means of a commercial device placed in the car and made up of a microphone which is placed in front of the driver and equipped with an echo canceller and built-in noise reduction. The connection with the call center is set up through a GSM link. By experiment, the LIN technique adapts the basic neural network speech recognizer to this new environment. Some variants devoted to reducing the number of estimated parameters are also introduced. The LIN technique, is also compared with some classical denoising techniques based on noise spectral subtraction. The obtained results confirm the validity of LIN for channel and noise adaptation, while the introduced variants are a valid alternative when a reduced model size is important. The best performances in our specific application were of 57.14% error reduction versus the performance obtained by general acoustic models and were achieved by joint use of a LIN and noise spectral subtraction.

Từ khóa

#Neural networks #Automata #Adaptation model #Noise reduction #Working environment noise #Acoustic noise #Speech recognition #Speech enhancement #Automatic speech recognition #Telecommunication traffic

Tài liệu tham khảo

gemello, 0, CSELT Hybrid HMM/Neural Networks Technology for Continuos Speech Recognition, Proc of IJCNN Como fissore, 1995, Acoustic-Phonetic Modeling for Flexible Vocabulary Speech Recognition, Proc of EUROSPEECH '95 10.1109/72.279192 10.1109/ICASSP.1991.150289 10.1109/ICASSP.2000.862122 10.1109/IJCNN.1998.687200 10.1109/ICSLP.1996.607854 neto, 0, HAn Incremental Speaker-Adaptation Technique for Hybrid HMM-MLP Recognizer, Proc of ICSLP, 3 10.1109/ICASSP.1996.550603 bourlard, 1993, Connectionist Speech Recognition: A Hybrid Approach neto, 1995, Unsupervised Speaker-Adaptation for Hybrid HMM-MLP Continuous Speech Recognition System, IEEE Workshop Speech Recognition, 187 10.1109/NNSP.1996.550065