Primitives-based evaluation and estimation of emotions in speech

Speech Communication - Tập 49 Số 10-11 - Trang 787-800 - 2007

Michael Grimm¹, Kristian Kroschel¹, Emily Mower Provost², Shrikanth Narayanan²

¹Institut für Nachrichtentechnik (INT) (Allemagne)

²USC - University of Southern California (Los Angeles, CA, 90089-0484, USA - États-Unis)

Tóm tắt

Từ khóa

Tài liệu tham khảo

Banse, 1996, Acoustic profiles in vocal emotion expression, J. Personality Social Psychol., 70, 614, 10.1037/0022-3514.70.3.614

Batliner, A., Fischer, K., Huber, R., Spilker, J., Nöth, E., 2000. Desperately seeking emotions or: Actors, wizards, and human beings. In: Proc. ICASSP, pp. 195–200.

Bulut, M., Narayanan, S., Syrdal, A., 2002. Expressive speech synthesis using a concatenative synthesizer. In: Proc. ICSLP, Denver, CO.

Carletta, 1996, Assessing agreement on classification tasks: the kappa statistic, Comput. Linguist., 22, 249

Cowie, 2003, Describing the emotional states that are expressed in speech, Speech Communication, 40, 5, 10.1016/S0167-6393(02)00071-7

Cowie, R., Douglas-Cowie, E., Savvidou, S, McMahon, E., Sawey, M., Schröder, M., 2000. ‘FEELTRACE’: an instrument for recording perceived emotion in real time. In: Douglas-Cowie, E., Cowie, R., Schröder, M. (Eds.), Proc. ISCA Workshop on Speech and Emotion: A Conceptual Framework for Research, Textflow, Belfast, pp. 19–24.

Cowie, 2001, Emotion recognition in human–computer interaction, IEEE Signal Process. Mag., 18, 32, 10.1109/79.911197

Dellaert, F., Polzin, T., Waibel, A., 1996. Recognizing emotion in speech. In: Proc. ICSLP, Vol. 3, Philadelphia, PA, USA, pp. 1970–1973.

Douglas-Cowie, E., Cowie, R., Schröder, M., 2003. The description of naturally occurring emotional speech. In: Proc. 15th Internat. Conf. on Phonetic Sciences, Barcelona, Spain, pp. 2877–2880.

Fischer, 2002

Fragopanagos, 2005, Emotion recognition in human–computer interaction, Neural Networks, 18, 389, 10.1016/j.neunet.2005.03.006

Grimm, M., Kroschel, K., 2005a. Rule-based emotion classification using acoustic features. In: Proc. 3rd Internat. Conf. on Telemedicine and Multimedia Communication, Kajetany, Poland.

Grimm, M., Kroschel, K., 2005b. Evaluierung von natürlichen Emotionen in Sprachsignalen. In: Proceedings 31. Deutsche Jahrestagung für Akustik, DAGA’05, München, Germany, pp. 731–732.

Grimm, M., Kroschel, K., 2005c. Evaluation of natural emotions using self assessment manikins. In: Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), San Juan, Puerto Rico, pp. 381–385.

Grimm, M., Mower, E., Narayanan, S., Kroschel, K., 2006a. Combining categorical and primitives-based emotion recognition. In: Proc. 14th European Signal Processing Conference (EUSIPCO), Florence, Italy.

Grimm, M., Kroschel, K., Narayanan, S., 2006b. Modeling emotion expression and perception behavior in auditive emotion evaluation. In: Proc. ISCA 3rd Internat. Conf. on Speech Prosody, Dresden, Germany, pp. 9–12.

Hammal, Z., Bozkurt, B., Couvreur, L., Unay, D., Caplier, A., Dutoit, T., 2005. Passive versus active: Vocal classification system. In: Proc. Eusipco, Antalya, Turkey.

Hernandez, C., 2005. Einsatz von Fuzzy Logic zur Erkennung von Emotionen in der Sprache, Studienarbeit, Universität Karlsruhe (TH), Germany.

Huang, C.-F, Akagi, M., 2005. A multi-layer fuzzy logical model for emotional speech perception. In: Proc. Eurospeech, Lisbon, Portugal, pp. 417–420.

Kehrein, R., 2002. The prosody of authentic emotions. In: Proc. Speech Prosody Conference, pp. 423–426.

Kroschel, 2004

Lang, 1980, Behavioral treatment and bio-behavioral assessment, 119

Lee, C., Narayanan, S., 2003. Emotion recognition using a data-driven fuzzy inference system. In: Proc. Eurospeech, Geneva, pp. 157–160.

Lee, 2005, Toward detecting emotions in spoken dialogs, IEEE Trans. Speech Audio Process., 13, 293, 10.1109/TSA.2004.838534

Lee, C., Narayanan, S., Pieraccini, R., 2001. Recognition of negative emotions from the speech signal. In: Proc. IEEE ASRU, Trento, Italy, pp. 240–243.

Lee, S., Yildirim, S., Kazemzadeh, A., Narayanan, S, 2005. An articulatory study of emotional speech production. In: Proc. Eurospeech, pp. 497–500.

Murray, 1993, Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion, J. Acoust. Soc. Amer., 93, 1097, 10.1121/1.405558

Nagel, A., 2005. Robuste Pitch-Extraktion für die Erkennung von Emotionen in der Sprache. Diploma Thesis, Universität Karlsruhe (TH), Germany.

Nwe, 2003, Speech emotion recognition using hidden markov models, Speech Communication, 41, 603, 10.1016/S0167-6393(03)00099-2

Oudeyer, 2003, The production and recognition of emotions in speech: features and algorithms, Int. J. Hum. Comput. Stud., 59, 157

Russell, 1977, Evidence for a three-factor theory of emotions, J. Res. Personality, 11, 273, 10.1016/0092-6566(77)90037-X

Scherer, 2005, What are emotions? And how can they be measured?, Social Sci. Inf., 44, 693, 10.1177/0539018405058216

Schölkopf, 2002

Schröder, M., Cowie, R., Douglas-Cowie, E., Westerdijk, M., Gielen, S., 2001. Acoustic correlates of emotion dimensions in view of speech synthesis. In: Proc. Eurospeech, Vol. 1, Aalborg, pp. 87–90.

Schuller, B., Lang, M., Rigoll, G., 2005. Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles. In: Proc. Interspeech, Lisbon, Portugal, pp. 805–808.

Schuller, B., Lang, M., Rigoll, G., 2006. Recognition of spontaneous emotions by speech within automotive environment. In: Proc. 32. Deutsche Jahrestagung für Akustik, DAGA’06, Braunschweig, Germany, pp. 57–58.

Ververidis, D., Kotropoulos, C., Pitas, I., 2004. Automatic emotional speech classification. In: Proc. ICASSP, Montreal, Canada, pp. 593–596.

Vidrascu, L., Devillers, L., 2005. Real-life emotion representation and detection in call centers data. In: Proc. First Internat. Conf. on Affective Computing and Intelligent Interaction (ACII), Beijing, China, pp. 739–746.

Vidrascu, L., Devillers, L., 2005. Detection of real-life emotions in call centers. In: Proc. Eurospeech, pp. 1841–1844.

Wundt, 1896

Yu, Y., Chang, E., Li, C., 2002. Computer recognition of emotion in speech. In: Proc. Intel Internat. Science and Engineering Fair.

Yu, C., Aoki, P.M., Woodruff, A., 2004. Detecting user engagement in everyday conversations. In: Proc. 8th Internat. Conf. on Spoken Language Processing (ICSLP), Vol. 2, Jeju Island, Korea, pp. 1329–1332.

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA