Voice Identity Recognition: Functional Division of the Right STS and Its Behavioral Relevance

Journal of Cognitive Neuroscience - Tập 27 Số 2 - Trang 280-291 - 2015
Sonja Schall1, Stefan J. Kiebel1,2,3, Burkhard Maeß1, Katharina von Kriegstein1,4
11Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig
22University Clinic Jena
33Technical University, Dresden
44Humboldt University of Berlin

Tóm tắt

Abstract

The human voice is the primary carrier of speech but also a fingerprint for person identity. Previous neuroimaging studies have revealed that speech and identity recognition is accomplished by partially different neural pathways, despite the perceptual unity of the vocal sound. Importantly, the right STS has been implicated in voice processing, with different contributions of its posterior and anterior parts. However, the time point at which vocal and speech processing diverge is currently unknown. Also, the exact role of the right STS during voice processing is so far unclear because its behavioral relevance has not yet been established. Here, we used the high temporal resolution of magnetoencephalography and a speech task control to pinpoint transient behavioral correlates: we found, at 200 msec after stimulus onset, that activity in right anterior STS predicted behavioral voice recognition performance. At the same time point, the posterior right STS showed increased activity during voice identity recognition in contrast to speech recognition whereas the left mid STS showed the reverse pattern. In contrast to the highly speech-sensitive left STS, the current results highlight the right STS as a key area for voice identity recognition and show that its anatomical-functional division emerges around 200 msec after stimulus onset. We suggest that this time point marks the speech-independent processing of vocal sounds in the posterior STS and their successful mapping to vocal identities in the anterior STS.

Từ khóa


Tài liệu tham khảo

Altmann, 2010, Processing of spectral and amplitude envelope of animal vocalizations in the human auditory cortex., Neuropsychologia, 48, 2824, 10.1016/j.neuropsychologia.2010.05.024

Andics, 2010, Neural mechanisms for voice recognition., Neuroimage, 52, 1528, 10.1016/j.neuroimage.2010.05.048

Belin, 2004, Thinking the voice: Neural correlates of voice perception., Trends in Cognitive Sciences, 8, 129, 10.1016/j.tics.2004.01.008

Belin, 2003, Adaptation to speaker's voice in right anterior temporal lobe., NeuroReport, 14, 2105, 10.1097/00001756-200311140-00019

Belin, 2002, Human temporal-lobe response to vocal sounds., Brain Research, Cognitive Brain Research, 13, 17, 10.1016/S0926-6410(01)00084-2

Belin, 2000, Voice-selective areas in human auditory cortex., Nature, 403, 309, 10.1038/35002078

Bestelmeyer, 2011, Right temporal TMS impairs voice detection., Current Biology, 21, R838, 10.1016/j.cub.2011.08.046

Boemio, 2005, Hierarchical and asymmetric temporal sensitivity in human auditory cortices., Nature Neuroscience, 8, 389, 10.1038/nn1409

Bonte, 2009, Dynamic and task-dependent encoding of speech and voice by phase reorganization of cortical oscillations., Journal of Neuroscience, 29, 1699, 10.1523/JNEUROSCI.3694-08.2009

Capilla, 2013, The early spatio-temporal correlates and task independence of cerebral voice processing studied with MEG., Cerebral Cortex, 23, 1388, 10.1093/cercor/bhs119

Charest, 2009, Electrophysiological evidence for an early processing of human voices., BMC Neuroscience, 10, 127, 10.1186/1471-2202-10-127

Crowley, 2004, A review of the evidence for P2 being an independent component process: Age, sleep and modality., Clinical Neurophysiology: Official Journal of the International Federation of Clinical Neurophysiology, 115, 732, 10.1016/j.clinph.2003.11.021

Davis, 2003, Hierarchical processing in spoken language comprehension., Journal of Neuroscience, 23, 3423, 10.1523/JNEUROSCI.23-08-03423.2003

De Lucia, 2010, A temporal hierarchy for conspecific vocalization discrimination in humans., Journal of Neuroscience, 30, 11210, 10.1523/JNEUROSCI.2239-10.2010

Ellis, 1997, Intra- and inter-modal repetition priming of familiar faces and voices., British Journal of Psychology, 88, 14, 10.1111/j.2044-8295.1997.tb02625.x

Fischl, 1999, Cortical surface-based analysis. II: Inflation, flattening, and a surface-based coordinate system., Neuroimage, 9, 195, 10.1006/nimg.1998.0396

Fischl, 1999, High-resolution intersubject averaging and a coordinate system for the cortical surface., Human Brain Mapping, 8, 272, 10.1002/(SICI)1097-0193(1999)8:4<272::AID-HBM10>3.0.CO;2-4

Formisano, 2008, “Who” is saying “what”? Brain-based decoding of human voice and speech., Science, 322, 970, 10.1126/science.1164318

Gaudrain, 2009, The role of glottal pulse rate and vocal tract length in the perception of speaker identity, 10.21437/Interspeech.2009-54

Giraud, 2007, Endogenous cortical rhythms determine cerebral specialization for speech perception and production., Neuron, 56, 1127, 10.1016/j.neuron.2007.09.038

Hanson, 2010, MEG: An introduction to methods, 10.1093/acprof:oso/9780195307238.001.0001

Hausfeld, 2012, Pattern analysis of EEG responses to speech and voice: Influence of feature grouping., Neuroimage, 59, 3641, 10.1016/j.neuroimage.2011.11.056

Hickok, 2007, The cortical organization of speech processing., Nature Reviews Neuroscience, 8, 393, 10.1038/nrn2113

Hsiao, 2013, Hemispheric asymmetry in perception: A differential encoding account., Journal of Cognitive Neuroscience, 25, 998, 10.1162/jocn_a_00377

Jamison, 2006, Hemispheric specialization for processing auditory nonspeech stimuli., Cerebral Cortex, 16, 1266, 10.1093/cercor/bhj068

Kriegstein, 2004, Distinct functional substrates along the right superior temporal sulcus for the processing of voices., Neuroimage, 22, 948, 10.1016/j.neuroimage.2004.02.020

Lang, 2009, Voice recognition in aphasic and non-aphasic stroke patients., Journal of Neurology, 256, 1303, 10.1007/s00415-009-5118-2

Lavner, 2000, The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels., Speech Communication, 30, 18, 10.1016/S0167-6393(99)00028-X

Leff, 2008, The cortical dynamics of intelligible speech., Journal of Neuroscience, 28, 13209, 10.1523/JNEUROSCI.2903-08.2008

Maris, 2007, Nonparametric statistical testing of EEG- and MEG-data., Journal of Neuroscience Methods, 164, 177, 10.1016/j.jneumeth.2007.03.024

Martin, 2008, Speech evoked potentials: From the laboratory to the clinic., Ear and Hearing, 29, 285, 10.1097/AUD.0b013e3181662c0e

Matsumoto, 2011, Left anterior temporal cortex actively engages in speech perception: A direct cortical stimulation study., Neuropsychologia, 49, 1350, 10.1016/j.neuropsychologia.2011.01.023

Obleser, 2008, Bilateral speech comprehension reflects differential sensitivity to spectral and temporal features., Journal of Neuroscience, 28, 8116, 10.1523/JNEUROSCI.1290-08.2008

Oostenveld, 2011, FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data., Computational Intelligence and Neuroscience, 2011, 156869, 10.1155/2011/156869

Pascual-Marqui, 2002, Standardized low-resolution brain electromagnetic tomography (sLORETA): Technical details., Methods and Findings in Experimental and Clinical Pharmacology, 24(Suppl. D), 5

Perrodin, 2011, Voice cells in the primate temporal lobe., Current Biology, 21, 1408, 10.1016/j.cub.2011.07.028

Petkov, 2008, A voice region in the monkey brain., Nature Neuroscience, 11, 367, 10.1038/nn2043

Poeppel, 2003, The analysis of speech in different temporal integration windows: Cerebral lateralization as “asymmetric sampling in time”., Speech Communication, 41, 245, 10.1016/S0167-6393(02)00107-3

Remedios, 2009, An auditory region in the primate insular cortex responding preferentially to vocal communication sounds., Journal of Neuroscience, 29, 1034, 10.1523/JNEUROSCI.4089-08.2009

Renvall, 2012, Of cats and women: Temporal dynamics in the right temporoparietal cortex reflect auditory categorical processing of vocalizations., Neuroimage, 62, 1877, 10.1016/j.neuroimage.2012.06.010

Rosen, 2011, Hemispheric asymmetries in speech perception: Sense, nonsense and modulations., PLoS One, 6, e24672, 10.1371/journal.pone.0024672

Schall, 2013, Early auditory sensory processing of voices is facilitated by visual mechanisms., Neuroimage, 77, 237, 10.1016/j.neuroimage.2013.03.043

Schweinberger, 2001, Human brain potential correlates of voice priming and voice recognition., Neuropsychologia, 39, 921, 10.1016/S0028-3932(01)00023-9

Scott, 2000, Identification of a pathway for intelligible speech in the left temporal lobe., Brain: A Journal of Neurology, 123, 2400, 10.1093/brain/123.12.2400

Shahin, 2005, Modulation of P2 auditory-evoked responses by the spectral complexity of musical sounds., NeuroReport, 16, 1781, 10.1097/01.wnr.0000185017.29316.63

Stevens, 2004, Dissociating the cortical basis of memory for voices, words and tones., Brain Research, Cognitive Brain Research, 18, 162, 10.1016/j.cogbrainres.2003.10.008

Taulu, 2004, Suppression of interference and artifacts by the signal space separation method., Brain Topography, 16, 269, 10.1023/B:BRAT.0000032864.93890.f9

Treue, 1999, Feature-based attention influences motion processing gain in macaque visual cortex., Nature, 399, 575, 10.1038/21176

Van Lancker, 1987, Voice discrimination and recognition are separate abilities., Neuropsychologia, 25, 829, 10.1016/0028-3932(87)90120-5

Van Lancker, 1982, Impairment of voice and face recognition in patients with hemispheric damage., Brain and Cognition, 1, 185, 10.1016/0278-2626(82)90016-1

Van Lancker, 1988, Phonagnosia: A dissociation between familiar and unfamiliar voices., Cortex, 24, 195, 10.1016/S0010-9452(88)80029-7

Van Lancker, 1989, Voice perception deficits: Neuroanatomical correlates of phonagnosia., Journal of Clinical and Experimental Neuropsychology, 11, 665, 10.1080/01688638908400923

von Kriegstein, 2003, Modulation of neural responses to speech by directing attention to voices or verbal content., Brain Research, Cognitive Brain Research, 17, 48, 10.1016/S0926-6410(03)00079-X

von Kriegstein, 2010, How the human brain recognizes speech in the context of changing speakers., Journal of Neuroscience, 30, 629, 10.1523/JNEUROSCI.2742-09.2010

Watson, 2012, Sound-induced activity in voice-sensitive cortex predicts voice memory ability., Frontiers in Psychology, 3, 89, 10.3389/fpsyg.2012.00089

Zaske, 2009, In the ear of the beholder: Neural correlates of adaptation to voice gender., European Journal of Neuroscience, 30, 527, 10.1111/j.1460-9568.2009.06839.x

Zatorre, 2001, Spectral and temporal processing in human auditory cortex., Cerebral Cortex, 11, 946, 10.1093/cercor/11.10.946