Emotion recognition in human-computer interaction

IEEE Signal Processing Magazine - Tập 18 Số 1 - Trang 32-80 - 2001
Roddy Cowie1, Ellen Douglas‐Cowie2, Nicolas Tsapatsoulis3, G. Votsis4, Stefanos Kollias, Winfried A. Fellenz, John G. Taylor
1School of Psychology,
2School of English
3Department of Electrical and Computer Engineering, National Technical University of Athens.
4National Technical University of Athens

Tóm tắt

Từ khóa


Tài liệu tham khảo

10.1044/jshr.1103.481

10.1007/BF00869311

coleman, 1979, Identification of emotional states using perceptual and acoustic analyses, Care of the Professional Voice, 1

10.1037/h0022151

10.1001/archpsyc.1986.01800030098011

sule, 1977, Emotional changes in human voice, Activitas Nervosa Superior, 19, 215

utsuki, 1976, Relationship between emotional state and fundamental frequency of speech, Rep Aeromedical Laboratory Japan Air Self-Defense Force, 16, 179

10.1515/9783111357447

10.1037/h0027355

oster, 1986, The identification of the mood of a speaker by hearing impaired listeners, Speech Transmission Lab Quarterly Progress Status Report 4 Stockholm, 79

sedlacek, 1963, Die melodie als faktor des emotionellen ausdrucks [Speech melody as a means of emotional expression], Folia Phoniatrica, 15, 89, 10.1159/000262956

o'connor, 1973, Intonation of Colloquial English

crystal, 1975, The English Tone of Voice

lynch, 1934, A phonophotographic study of trained and untrained voices reading factual and dramatic material, Arch Speech, 1, 9

10.1080/03637753909374863

cowan, 1936, Pitch and intensity characteristics of stage speech, Arch Speech

harre, 1986, Emotion talk across times, The Social Construction of Emotions, 220

crystal, 1969, Prosodic Systems and Intonation in English

schubiger, 1958, English Intonation Its Form and Function

10.1121/1.405558

10.1037/0033-2909.97.3.412

jaffe, 1960, Rhythms of Dialogue

hicks, 1978, An acoustical/temporal analysis of emotional stress in speech, Dissertation Abstracts International, 41

breclder, 1989, On defining attitude theory: Once more with feeling, inAttitude Structure and Function

fishbein, 1975, Belief Attitude Intention and Behavior An Introduction to Theory and Research

cowie, 1999, Changing emotional tone in dialogue and its prosodic correlates, Proc ESCA Workshop on Dialogue and Prosody

rolls, 1999, The Brain and Emotion

10.1111/j.1600-0447.1988.tb05118.x

10.1016/S0163-6383(96)90035-1

10.1007/978-1-4613-2892-6_18

10.1006/csla.1994.1020

averill, 1986, Acquisition of emotions in adulthood, The Social Construction of Emotions, 100

fox, 1992, If it's not left it's right, Amer Psychol, 46, 863, 10.1037/0003-066X.46.8.863

tench, 1996, The Intonation Systems of English

morsbach, 1986, A Japanese emotion: Amae, The Social Construction of Emotions, 289

10.1080/02699938708408362

10.1016/B978-0-12-558704-4.50011-6

10.1002/0470013494.ch3

10.1017/CBO9780511659911

plutchik, 1980, Emotion A Psychoevolutionary Synthesis

tomkins, 1982, Affect theory, Emotion in the Human Face

10.1002/0470013494

10.1037/h0041133

10.1017/S0025100300006277

ginsberg, 1987, Readings in Nonmonotonic Reasoning

10.1088/0954-898X_3_1_007

davitz, 1964, The Communication of Emotional Meaning

osgood, 1957, The Measurement of Meaning

1999, Confidence estimation and on-line retraining of neural networks, EC TMR Project PHYSTA Report

ekman, 1975, Pictures of Facial Affect

1999, Test material format and availability, EC TMR Project PHYSTA

10.1109/CVPR.1997.609381

10.1109/76.611173

parke, 1996, Computer Facial Animation

lam, 1998, An analytic to holistic approach for face recognition based on a single frontal view, IEEE Trans Pattern Anal Machine Intell, 20

10.1016/S0892-1997(05)80200-0

suzuki, 1995, Extraction of precise fundamental frequency based on harmonic structure of speech, Proc 15th Int Congr Acoustics, 3, 161

mousset, 0, A comparison of recent several methods of fundamental frequency estimation, Proc ICSLP 96, 1273

1998, Hybrid systems for feature to symbol extraction, EC TMR Project PHYSTA Report

izzo, 1998, Multiresolution techniques and emotional speech, PHYSTA Project Report

mcgilloway, 1997, Negative symptoms and speech parameters in schizophrenia

mcgilloway, 1995, Prosodic signs of emotion in speech: Preliminary results from a new technique for automatic statistical analysis, Proc 13th ICPhS, 1989

10.1515/9783110869125

10.1121/1.1918222

trojan, 1952, Der Ausdruck der Sprechstimme

fonagy, 1963, Emotional patterns in intonation and music, Z Phonet Sprachwiss Kommunikationsforsch, 16, 293

10.1121/1.1913238

fonagy, 1978, Emotions, voice and music, Language and Speech, 21, 34, 10.1177/002383097802100102

muller, 1960, Experimentelle unterusuchungen zur stimmlichen darstellung von gefuehlen [Experimental studies on vocal portrayal of emotion]

kotlyar, 1976, Acoustic correlates of the emotional content of vocalized speech, J Acoust Academy of Sciences of the USSR, 22, 208

10.1515/9783110850390

havrdova, 1979, Changes of the voice expression during suggestively influenced states of experiencing, Activitas Nervosa Superior, 21, 33

1987, DSM III&#x2014 Diagnostic and Statistical Manual of Mental Disorders III

sverts, 1998, Prosody and conversation, Language and Speech (Special Issue), 41

10.1037/0033-2909.115.1.102

mcgilloway, 1995, Prosodic signs of emotion in speech: Preliminary results from a new technique for automatic statistical analysis, Proc XIIIth Int Congr Phonetic Sciences, i, 250

hoffe, 1960, Ueber beziehung von sprachmelodie und lautstarke [On the relation between speech melody and intensity], Phonetica, 5, 129

10.1037/0022-3514.70.3.614

10.1080/03637754109374888

10.1109/ICSLP.1996.608027

fonagy, 1978, A new method of investigating the perception of prosodic features, Language and Speech, 21, 34, 10.1177/002383097802100102

williams, 1969, On determining the emotional state of pilots during flight: An exploratory study, Aerospace Medicine, 40, 1369

douglas-cowie, 1998, International settings as markers of discourse units in telephone conversations, Language and Speech (Special Issue Prosody and Conversation), 41, 351

10.1111/j.1460-2466.1959.tb00286.x

ladd, 1996, Intonational Phonology

10.1121/1.1916060

10.1126/science.88.2286.382

10.1121/1.392466

uldall, 1960, Attitudinal meanings conveyed by intonational contours, Language and Speech, 3, 223, 10.1177/002383096000300403

couper kuhlen, 1986, An Introduction to English Prosody, 176

10.1121/1.391450

cutler, 1986, On the analysis of prosodic turn-taking cues, Intonation in Discourse, 139

10.3758/BF03206502

keller, 1995, A statistical timing model for French, Proc 13th Int Congr Phonetic Sciences, 3, 302

liberman, 1977, On stress and linguistic rhythm, Linguistic Inquiry, 8, 249

10.1037/10001-000

ekman, 1973, Darwin and Facial Expressions

brown, 1980, Questions of Intonation

bruce, 1998, In the Eye of the Beholder The Science of Face Perception

davis, 1975, Recognition of Facial Expressions

scherer, 1984, Approaches to Emotion

10.1109/ICASSP.1997.596153

reichl, 0, Syllable segmentation of continuous speech with artificial neural networks, Proc Eurospeech 93, 3, 1771

10.1109/ICSLP.1996.607838

bengio, 0, Phonetically motivated acoustic parameters for continuos speech recognition using neural networks, Proc Eurospeech-91, 551

esposito, 0, Preprocessing and neural classification of the English stops [b, d, g, p, t, kJ, Proc ICSLP 96, 2, 1249

esposito, 1996, A Rasta-PLP and TDNN based automatic system for recognizing stop consonants: Performance studies, Vietri sul Mare (SA)

laver, 1980, The Phonetic Description of Voice Quality

10.3109/00016488009131746

10.1109/ICSLP.1996.607837

klasmeyer, 1995, Objective voice parameters to characterise the emotional content in speech, Proc 13th Int Congr Phonetic Sciences, 2, 182

cowie, 1999, What a neural net needs to know about emotion words, Proc 3rd World Multiconf on Circuits Systems Comms and Computers

10.1037/0003-066X.40.3.355

10.1097/00006842-195309000-00007

roseman, 1984, Cognitive determinants of emotion, Review of Personality and Social Psychology Vol 5 Emotions Relationships and Health

10.1037/0022-3514.59.5.899

10.1017/CBO9780511571299

frijda, 1986, The Emotions

10.1111/1467-8659.1410035

10.1016/0167-8655(95)00089-5

cowie, 1999, The prosodic correlates of expressive reading, Proc 14th Int Congr Phonetic Sciences, 2327

scalaidhe, 1997, Science, 278, 1135, 10.1126/science.278.5340.1135

kamachi, 1999, The dynamics of facial expression judgment, Perception, 28 s, 54

10.1037/0033-2909.95.1.52

10.1038/385254a0

10.1109/CVPR.1994.323813

10.1038/39051

ekman, 1978, The Facial Action Coding System

10.1109/34.216726

shibui, 1999, Categorical perception and semantic information processing of facial expressions, Perception, 28 s, 114

1996, MPEG4 SNHC Face and Body Definition and Animation Parameters ISO/IEC JTCl/SC29/WG11MPEG96/N1365

bruce, 1988, Recognizing Faces

ekman, 1992, NSF planning workshop on facial expression understanding, Tech Rep

gabrieli, 1998, The role of the left prefontal cortex in language and memory, Proc Natl Academy Sciences USA, 95, 906, 10.1073/pnas.95.3.906

jaynes, 1976, The Breakdown of the Bicameral Mind

young, 1989, Handbook of Research on Face Processing

lucas, 0, An iterative image registration technique with an application to stereo vision, Proc 7th Intl Joint Conf onAl

10.1017/S0048577299971664

ekman, 0, FACSAID: A computer database for predicting affective phenomena from facial movement

cabeza, 1997, Cognitive Neuroscience, 9, 1, 10.1162/jocn.1997.9.1.1

lien, 1998, Subtly different facial expression recognition and emotion expression intensity estimation, Proc IEEE CVPR, 853

silverman, 1992, ToBI: A standard for labelling English prosody, Proc Int Conf Spoken Language Processing, 286

mase, 1991, Recognition of facial expression from optical flow, IEICE Trans, e74, 3474

roivainen, 1993, 3-D motion estimation in model-based facial image coding, IEEE Trans Pattern Anal Machine Intell, 15, 545, 10.1109/34.216724

katz, 1996, A combination of vocal F0 dynamic and summary features discriminates between pragmatic categories of in-fant-directed speech, Child Development, 67, 205

cahn, 1990, The generation of affect in synthesised speech, J American Voice I/O Society, 8, 1

10.1145/965161.806812

essa, 1993, Physically-based modeling for graphics and vision, Directions in Geometric Computing Information Geometers

10.1109/MNRAO.1994.346257

essa, 1995, Coding, analysis, interpretation and recognition of facial expressions, Tech Rep, 325

10.1109/AFGR.1996.557276

10.1109/MMSP.1998.738918

10.1109/CVPR.1997.609393

10.1109/ACSSC.1994.471664

10.1109/ICPR.1996.547019

10.1109/AFGR.1996.557248

10.1109/AFGR.1996.557278

10.1109/AFGR.1998.670934

fieguth, 0, Color-based tracking of image regions with changes in geometry and illumination, Proceedings CVPR 1996, 403

10.1109/AFGR.1996.557261

10.1109/AFGR.1998.670953

kruger, 1999, Affine face tracking using a wavelet network, Proc Int Workshop on Recognition Analysis and Tracking of Faces and Gestures in Real-time Systems

10.1109/AFGR.1996.557275

10.1109/AFGR.1996.557290

10.1017/S0048577299971184

10.1109/AFGR.1996.557289

10.1109/AFGR.1996.557266

giles, 1979, Accommodation theory: Optimal levels of convergence, Language and Social Psychology, 45

cowie, 1995, Speakers and hearers are people: Reflections on speech deterioration as a consequence of acquired deafness, Profound Deafness and Speech Communication, 510

herpers, 1995, An attentional processing strategy to detect and analyse the prominent facial regions, Proc Int Conf on Automatic Face and Gesture Recognition, 214

10.1016/0004-3702(81)90024-2

10.1037/0022-3514.37.11.2049

golomb, 1991, Sexnet: A neural net identifies sex from human faces, NIPS 3, 572

padgett, 1996, Categorical perception in facial emotion classification, inProc Cognitive Science Conf, 18, 249

padgett, 1997, Representing face images for emotion classification, Advances in neural information processing systems, 9, 894

ekman, 1975, Unmasking the Face

ekman, 1993, Final report to NSF of the planning workshop on facial expression understanding, Tech Rep

pelachaud, 1994, Final report to NSF of the standards for facial animation workshop, Tech Rep

10.1109/CVPR.1993.340962

10.1007/BF00158167

10.1007/BF00133568

10.1109/ICSLP.1996.607983

wallzer, 1997, Improvising linguistic style: Social and affect bases for agent personality, Proc Int Conf Autonomous Agents ACM SIGART, 96

cornelius, 1996, The Science of Emotion

oatley, 1995, Communicative theory of emotions: Empirical tests, mental models & implications for social interaction, Goals and Affect

plutchik, 1994, The Psychology and Biology of Emotion, 58

10.1037//0033-2909.99.2.143

sakaguchi, 1995, Facial expression recognition from image sequence using hidden Markov model, VLBV95, a 5

scherer, 1984, On the nature and function of emotion: A component process approach, Approaches to Emotion

wu, 1998, Optical flow estimation using wavlet motion model, Proc Int Conf Computer Vision (ICCV)

oatley, 1996, Understanding Emotions

arnold, 1960, Emotion and Personality Vol 2 Physiological Aspects

10.1109/ROMAN.1996.568857

lazarus, 1991, Emotion andAdaptation, 10.1093/oso/9780195069945.001.0001

tsapatsoulis, 1999, On the use of radon transform for facial expression recognition, Proc Int l Conf Information Systems Analysis and Synthesis

10.1109/CVPR.1994.323812

black, 1993, The robust estimation of optical flow, Proc Int Conf Computer Vision, 231

10.1109/34.506414

10.1109/ICIP.1997.638829

yacoob, 1994, Recognizing human facial expressions, Proc 2nd Workshop on Visual Form, 584

10.1109/72.536309

10.1109/5.664277