Recognition experiments with the SpeechDat-Car Aurora Spanish database using 8 kHz- and 16 kHz-sampled signals

C. Nadeu1, M. Tolos1
1TALP Research Center, Universitat Poliltècnica de Catalunya, Barcelona, Spain

Tóm tắt

Like the other SpeechDat-Car databases, the Spanish one has been collected using a 16 kHz sampling frequency, and several microphone positions and environmental noises. We aim at clarifying whether there is any advantage in terms of recognition performance from processing the 16 kHz-sampled signals instead of the usual 8 kHz-sampled ones. Recognition tests have been carried out within the Aurora experimental framework, which includes signals from both a close-talking microphone and a distant microphone. Our preliminary results indicate that it is possible to get a performance improvement from the increased bandwidth in the noisy car environment.

Từ khóa

#Databases #Microphones #Frequency #Working environment noise #Testing #Sampling methods #Bandwidth #Speech recognition #Telecommunication standards #Standards development

Tài liệu tham khảo

2000, ETSI ES 201 108 V1.1.2 Distributed Speech Recognition; Front-end pearce, 2000, Enabling New Speech Driven Services for Mobile Devices: An overview of the ETSI standards activities for DSR Front-ends, Applied Voice Input/Output Society Conf (AVIOS 2000) 0, Baseline Results for Subset of SDC Finnish Database for ETSI STQ WI008 Advanced Front-End Evaluation, STQ Aurora DSR Working Group Document AU/225/00 moreno, 2001, SpeechDat-Car Spanish Database. SpeechDat-Cat Project LE4-8334, UPC nadeu, 1995, Proc EUROSPEECH, 1381 10.1109/TASSP.1980.1163420 macho, 2000, Spanish SDC-Aurora Database for ETSI STQ Aurora WI008 Advanced DSR Front-End, UPC nadeu, 2001, Speech Communication - Noise Robust ASR, 34, 93 moreno, 2000, SPEECHDAT-CAR. A Large Speech Database for Automotive Environments, Proc II LREC