The role of voice quality and prosodic contour in affective speech perception

Speech Communication - Tập 54 - Trang 414-429 - 2012
Ioulia Grichkovtsova1, Michel Morel1, Anne Lacheret2,3
1Laboratoire CRISCO, EA 4255, Université de Caen Basse-Normandie, Esplanade de la Paix, 14032 Caen, France
2UFR LLPHI, Département de Sciences du Langage, Laboratoire MODYCO, UMR CNRS 7114, Université Paris Ouest Nanterre la Défense, 200, avenue de la République, 92001 Nanterre, France
3Institut Universitaire de France, 103 Bd Saint-Michel, 75005 Paris, France

Tài liệu tham khảo

Aubergé, 2003, Can we hear the prosody of smile?, Speech Comm., 40, 87, 10.1016/S0167-6393(02)00077-8 Baayen, 2004, Statistics in psycholinguistics: a critique of some current gold standards, Mental Lexicon Working Papers, 1, 1 Bachorowski, 1999, Vocal expression and perception of emotion, Curr. Dir. Psychol. Sci., 8, 53, 10.1111/1467-8721.00013 Banse, 1996, Acoustic profiles in vocal emotion expression, J. Pers. Soc. Psychol., 70, 614, 10.1037/0022-3514.70.3.614 Bänziger, 2005, The role of intonation in emotional expressions, Speech Comm., 46, 252, 10.1016/j.specom.2005.02.016 Barkhuysen, 2010, Crossmodal and incremental perception of audiovisual cues to emotional speech, Lang. Speech, 53, 3, 10.1177/0023830909348993 Beck, 2009, Multiple focus, J. Semantics, 26, 159, 10.1093/jos/ffp001 Campbell, N., Mokhtari, P., 2006. Voice quality: the 4th prosodic dimension. In: Proc. XVth Internat. Congress of Phonetic Sciences, Barcelona, Spain, pp. 2417–2420. Chen, A.J., 2005. Universal and language-specific perception of paralinguistic intonational meaning. Ph.D. Thesis. d’Alessandro, 2006, Voice source parameters and prosodic analysis, 63 Dromey, 2005, Recognition of affective prosody by speakers of English as a first or foreign language, Speech Comm., 47, 351, 10.1016/j.specom.2004.09.010 Dutoit, T., Pagel, V., Pierret, N., Bataille, F., van der Vrecken, O., 1996. The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes. In: ICSLP, Philadelphia, pp. 1393–1396. Ekman, 1999, Basic emotions Elfenbein, 2003, Universal and cultural differences in recognizing emotions, Curr. Dir. Psychol. Sci., 12, 159, 10.1111/1467-8721.01252 Erickson, D., 2010. Perception by Japanese, Korean and American listeners to a Korean speaker’s recollection of past emotional events: some acoustic cues. In: Speech Prosody 2010, Chicago. Garcia, M.N., d’Alessandro, C., Bailly, G., de Mareuil, P.B., Morel, M., 2006. A joint prosody evaluation of French text-to-speech systems. In: Proc. LREC, pp. 307–310. Ghio, 2007, PERCEVAL: une station automatisée de tests de PERCeption et d’EVALuation auditive et visuelle, TIPA, 22, 115 Gobl, 2003, The role of voice quality in communicating emotion, mood and attitude, Speech Comm., 40, 189, 10.1016/S0167-6393(02)00082-1 Grichkovtsova, I., Lacheret, A., Morel, M., 2007. The role of intonation and voice quality in the affective speech perception. In: Proc. Interspeech. Grichkovtsova, 2009, Perception of affective prosody in natural and synthesized speech: which methodological approach?, 371 Hammerschmidt, 2007, Acoustical correlates of affective prosody, J. Voice, 21, 531, 10.1016/j.jvoice.2006.03.002 2009 Hox, 2002 Izard, 1971 Izdebski, 2007 Jaeger, 2008, Categorical data analysis: away from ANOVAs (transformation or not) and towards logit mixed models, Mem. Lang., 59, 434, 10.1016/j.jml.2007.11.007 Johnstone, 2000, Vocal communication of emotion, 220 Lakshminarayanan, 2003, The effect of spectral manipulations on the identification of affective and linguistic prosody, Brain Lang., 84, 250, 10.1016/S0093-934X(02)00516-3 Laver, 1980 Laver, 1994 Mejvaldova, J., Horak, P., 2002. Synonymie et homonymie attitudinale en tchèque et en français. In: Proc. Internat. Conf. on Speech Prosody 2002, Aix-en-Provence, France. Moineddin, 2007, A simulation study of sample size for multilevel logistic regression models, BMC Med. Res. Methodol., 7, 1, 10.1186/1471-2288-7-34 Morel, 2004, Le rôle de l’intonation dans la communication vocale des émotions: test par la synthèse, Cah. Inst. Ling. Louvain, 30, 207, 10.2143/CILL.30.1.519219 Morel, 2001, Kali, synthèse vocale à partir du texte: de la conception à la mise en oeuvre, Trait. Automat. Lang., 42, 1 Murray, 1993, Towards the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion, J. Acoust. Soc. Amer., 93, 1097, 10.1121/1.405558 Murray, 2008, Applying an analysis of acted vocal emotions to improve the simulation of synthetic speech, Comput. Speech Lang., 22, 107, 10.1016/j.csl.2007.06.001 Oatley, 1987, Towards a cognitive theory of emotions, Cognition Emotion, 1, 29, 10.1080/02699938708408362 Ortony, 1990, What’s basic about basic emotions?, Cognition Emotion, 97, 315 Pell, 2009, Recognizing emotions in a foreign language, J. Nonverb. Behav., 33, 107, 10.1007/s10919-008-0065-7 Pell, 2009, Factors in the recognition of vocally expressed emotions: a comparison of four languages, J. Phonetics, 37, 417, 10.1016/j.wocn.2009.07.005 Plutchik, 1993, Emotions and their vicissitudes: emotions and psychopathology Power, 2008 Prudon, 2004, Unit selection synthesis of prosody: evaluation using diphone transplantation, 203 R Development Core Team, 2008. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Rilliard, 2009, Multimodal indices to Japanese and French prosodically expressed social affects, Lang. Speech, 52, 223, 10.1177/0023830909103171 Rodero, 2011, Intonation and emotion: influence of pitch levels and contour type on creating emotions, J. Voice, 25, 25, 10.1016/j.jvoice.2010.02.002 Scherer, 2003, Vocal communication of emotions: a review of research paradigms, Speech Comm., 40, 227, 10.1016/S0167-6393(02)00084-5 Scherer, 2001, Emotion inferences from vocal expression correlate across languages and cultures, J. Cross-Cult. Psychol., 32, 76, 10.1177/0022022101032001009 Schröder, 2008, Approaches to emotional expressivity in synthetic speech, 307 Shochi, T., Aubergé, V., Rillard, A., 2006. How prosodic attitudes can be false friends: Japanese vs. French social affects. In: Proc. Internat. Conf. on Speech Prosody 2006, Dresden, Germany. Shochi, 2009, Intercultural perception of social affective prosody, 31 Thompson, 2004, Decoding speech prosody: do music lessons help?, Emotion, 4, 46, 10.1037/1528-3542.4.1.46 Williams, 1972, Emotions and speech: some acoustic correlates, J. Acoust. Soc. Amer., 52, 1238, 10.1121/1.1913238 Yanushevskaya, I., Gobl, C., Ni Chasaide, A., 2006. Mapping voice to affect: Japanese listeners. In: Proc. 3rd Internat. Conf. on Speech Prosody 2006, Dresden, Germany.