Recognition of negative emotions from the speech signal

C.M. Lee1, S. Narayanan1, R. Pieraccini2
1Department of Electrical Engineering and IMSC, University of Southern California, Los Angeles, CA, USA
2Speechworks International, Inc., NY, USA

Tóm tắt

This paper reports on methods for automatic classification of spoken utterances based on the emotional state of the speaker. The data set used for the analysis comes from a corpus of human-machine dialogues recorded from a commercial application deployed by SpeechWorks. Linear discriminant classification with Gaussian class-conditional probability distribution and k-nearest neighbors methods are used to classify utterances into two basic emotion states, negative and non-negative The features used by the classifiers are utterance-level statistics of the fundamental frequency and energy of the speech signal. To improve classification performance, two specific feature selection methods are used; namely, promising first selection and forward feature selection. Principal component analysis is used to reduce the dimensionality of the features while maximizing classification accuracy. Improvements obtained by feature selection and PCA are reported. We also report the results.

Từ khóa

#Emotion recognition #Speech recognition #Principal component analysis #Automatic speech recognition #Speech analysis #Man machine systems #Linear discriminant analysis #Probability distribution #Statistical distributions #Frequency

Tài liệu tham khảo

petrushin, 1999, Emotion in Speech: Recognition and Application to Call Centers, Artif Neu Net Engr (ANNIE), 7 scherer, 2000, A Cross-Cultural Investigation of Emotion Inferences from Voice and Speech: Implications for Speech Technology, Proc ICSLP Beijing China 2000 10.1109/ICSLP.1996.608022 batliner, 0, Desperately Seeking Emotions: Actors, Wizards, and Human Beings, Proceedings of the ISCA Workshop on Speech and Emotion mcgilloway, 2000, Approaching Automatic Recognition of Emotion from Voice: A Rough Benchmark, ISCA Workshop on Speech and Emotion 10.1109/AFGR.1996.557292 10.1017/CBO9780511571299 duda, 1999, Pattern Classification cowie, 2001, Emotion Recoginition in Human-Computer Interaction, IEEE Signal Proc Mag, 18, 32, 10.1109/79.911197