Voice Identity Recognition: Functional Division of the Right STS and Its Behavioral Relevance
Tóm tắt
The human voice is the primary carrier of speech but also a fingerprint for person identity. Previous neuroimaging studies have revealed that speech and identity recognition is accomplished by partially different neural pathways, despite the perceptual unity of the vocal sound. Importantly, the right STS has been implicated in voice processing, with different contributions of its posterior and anterior parts. However, the time point at which vocal and speech processing diverge is currently unknown. Also, the exact role of the right STS during voice processing is so far unclear because its behavioral relevance has not yet been established. Here, we used the high temporal resolution of magnetoencephalography and a speech task control to pinpoint transient behavioral correlates: we found, at 200 msec after stimulus onset, that activity in right anterior STS predicted behavioral voice recognition performance. At the same time point, the posterior right STS showed increased activity during voice identity recognition in contrast to speech recognition whereas the left mid STS showed the reverse pattern. In contrast to the highly speech-sensitive left STS, the current results highlight the right STS as a key area for voice identity recognition and show that its anatomical-functional division emerges around 200 msec after stimulus onset. We suggest that this time point marks the speech-independent processing of vocal sounds in the posterior STS and their successful mapping to vocal identities in the anterior STS.
Từ khóa
Tài liệu tham khảo
Altmann, 2010, Processing of spectral and amplitude envelope of animal vocalizations in the human auditory cortex., Neuropsychologia, 48, 2824, 10.1016/j.neuropsychologia.2010.05.024
Andics, 2010, Neural mechanisms for voice recognition., Neuroimage, 52, 1528, 10.1016/j.neuroimage.2010.05.048
Belin, 2004, Thinking the voice: Neural correlates of voice perception., Trends in Cognitive Sciences, 8, 129, 10.1016/j.tics.2004.01.008
Belin, 2003, Adaptation to speaker's voice in right anterior temporal lobe., NeuroReport, 14, 2105, 10.1097/00001756-200311140-00019
Belin, 2002, Human temporal-lobe response to vocal sounds., Brain Research, Cognitive Brain Research, 13, 17, 10.1016/S0926-6410(01)00084-2
Bestelmeyer, 2011, Right temporal TMS impairs voice detection., Current Biology, 21, R838, 10.1016/j.cub.2011.08.046
Boemio, 2005, Hierarchical and asymmetric temporal sensitivity in human auditory cortices., Nature Neuroscience, 8, 389, 10.1038/nn1409
Bonte, 2009, Dynamic and task-dependent encoding of speech and voice by phase reorganization of cortical oscillations., Journal of Neuroscience, 29, 1699, 10.1523/JNEUROSCI.3694-08.2009
Capilla, 2013, The early spatio-temporal correlates and task independence of cerebral voice processing studied with MEG., Cerebral Cortex, 23, 1388, 10.1093/cercor/bhs119
Charest, 2009, Electrophysiological evidence for an early processing of human voices., BMC Neuroscience, 10, 127, 10.1186/1471-2202-10-127
Crowley, 2004, A review of the evidence for P2 being an independent component process: Age, sleep and modality., Clinical Neurophysiology: Official Journal of the International Federation of Clinical Neurophysiology, 115, 732, 10.1016/j.clinph.2003.11.021
Davis, 2003, Hierarchical processing in spoken language comprehension., Journal of Neuroscience, 23, 3423, 10.1523/JNEUROSCI.23-08-03423.2003
De Lucia, 2010, A temporal hierarchy for conspecific vocalization discrimination in humans., Journal of Neuroscience, 30, 11210, 10.1523/JNEUROSCI.2239-10.2010
Ellis, 1997, Intra- and inter-modal repetition priming of familiar faces and voices., British Journal of Psychology, 88, 14, 10.1111/j.2044-8295.1997.tb02625.x
Fischl, 1999, Cortical surface-based analysis. II: Inflation, flattening, and a surface-based coordinate system., Neuroimage, 9, 195, 10.1006/nimg.1998.0396
Fischl, 1999, High-resolution intersubject averaging and a coordinate system for the cortical surface., Human Brain Mapping, 8, 272, 10.1002/(SICI)1097-0193(1999)8:4<272::AID-HBM10>3.0.CO;2-4
Formisano, 2008, “Who” is saying “what”? Brain-based decoding of human voice and speech., Science, 322, 970, 10.1126/science.1164318
Gaudrain, 2009, The role of glottal pulse rate and vocal tract length in the perception of speaker identity, 10.21437/Interspeech.2009-54
Giraud, 2007, Endogenous cortical rhythms determine cerebral specialization for speech perception and production., Neuron, 56, 1127, 10.1016/j.neuron.2007.09.038
Hausfeld, 2012, Pattern analysis of EEG responses to speech and voice: Influence of feature grouping., Neuroimage, 59, 3641, 10.1016/j.neuroimage.2011.11.056
Hickok, 2007, The cortical organization of speech processing., Nature Reviews Neuroscience, 8, 393, 10.1038/nrn2113
Hsiao, 2013, Hemispheric asymmetry in perception: A differential encoding account., Journal of Cognitive Neuroscience, 25, 998, 10.1162/jocn_a_00377
Jamison, 2006, Hemispheric specialization for processing auditory nonspeech stimuli., Cerebral Cortex, 16, 1266, 10.1093/cercor/bhj068
Kriegstein, 2004, Distinct functional substrates along the right superior temporal sulcus for the processing of voices., Neuroimage, 22, 948, 10.1016/j.neuroimage.2004.02.020
Lang, 2009, Voice recognition in aphasic and non-aphasic stroke patients., Journal of Neurology, 256, 1303, 10.1007/s00415-009-5118-2
Lavner, 2000, The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels., Speech Communication, 30, 18, 10.1016/S0167-6393(99)00028-X
Leff, 2008, The cortical dynamics of intelligible speech., Journal of Neuroscience, 28, 13209, 10.1523/JNEUROSCI.2903-08.2008
Maris, 2007, Nonparametric statistical testing of EEG- and MEG-data., Journal of Neuroscience Methods, 164, 177, 10.1016/j.jneumeth.2007.03.024
Martin, 2008, Speech evoked potentials: From the laboratory to the clinic., Ear and Hearing, 29, 285, 10.1097/AUD.0b013e3181662c0e
Matsumoto, 2011, Left anterior temporal cortex actively engages in speech perception: A direct cortical stimulation study., Neuropsychologia, 49, 1350, 10.1016/j.neuropsychologia.2011.01.023
Obleser, 2008, Bilateral speech comprehension reflects differential sensitivity to spectral and temporal features., Journal of Neuroscience, 28, 8116, 10.1523/JNEUROSCI.1290-08.2008
Oostenveld, 2011, FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data., Computational Intelligence and Neuroscience, 2011, 156869, 10.1155/2011/156869
Pascual-Marqui, 2002, Standardized low-resolution brain electromagnetic tomography (sLORETA): Technical details., Methods and Findings in Experimental and Clinical Pharmacology, 24(Suppl. D), 5
Perrodin, 2011, Voice cells in the primate temporal lobe., Current Biology, 21, 1408, 10.1016/j.cub.2011.07.028
Poeppel, 2003, The analysis of speech in different temporal integration windows: Cerebral lateralization as “asymmetric sampling in time”., Speech Communication, 41, 245, 10.1016/S0167-6393(02)00107-3
Remedios, 2009, An auditory region in the primate insular cortex responding preferentially to vocal communication sounds., Journal of Neuroscience, 29, 1034, 10.1523/JNEUROSCI.4089-08.2009
Renvall, 2012, Of cats and women: Temporal dynamics in the right temporoparietal cortex reflect auditory categorical processing of vocalizations., Neuroimage, 62, 1877, 10.1016/j.neuroimage.2012.06.010
Rosen, 2011, Hemispheric asymmetries in speech perception: Sense, nonsense and modulations., PLoS One, 6, e24672, 10.1371/journal.pone.0024672
Schall, 2013, Early auditory sensory processing of voices is facilitated by visual mechanisms., Neuroimage, 77, 237, 10.1016/j.neuroimage.2013.03.043
Schweinberger, 2001, Human brain potential correlates of voice priming and voice recognition., Neuropsychologia, 39, 921, 10.1016/S0028-3932(01)00023-9
Scott, 2000, Identification of a pathway for intelligible speech in the left temporal lobe., Brain: A Journal of Neurology, 123, 2400, 10.1093/brain/123.12.2400
Shahin, 2005, Modulation of P2 auditory-evoked responses by the spectral complexity of musical sounds., NeuroReport, 16, 1781, 10.1097/01.wnr.0000185017.29316.63
Stevens, 2004, Dissociating the cortical basis of memory for voices, words and tones., Brain Research, Cognitive Brain Research, 18, 162, 10.1016/j.cogbrainres.2003.10.008
Taulu, 2004, Suppression of interference and artifacts by the signal space separation method., Brain Topography, 16, 269, 10.1023/B:BRAT.0000032864.93890.f9
Treue, 1999, Feature-based attention influences motion processing gain in macaque visual cortex., Nature, 399, 575, 10.1038/21176
Van Lancker, 1987, Voice discrimination and recognition are separate abilities., Neuropsychologia, 25, 829, 10.1016/0028-3932(87)90120-5
Van Lancker, 1982, Impairment of voice and face recognition in patients with hemispheric damage., Brain and Cognition, 1, 185, 10.1016/0278-2626(82)90016-1
Van Lancker, 1988, Phonagnosia: A dissociation between familiar and unfamiliar voices., Cortex, 24, 195, 10.1016/S0010-9452(88)80029-7
Van Lancker, 1989, Voice perception deficits: Neuroanatomical correlates of phonagnosia., Journal of Clinical and Experimental Neuropsychology, 11, 665, 10.1080/01688638908400923
von Kriegstein, 2003, Modulation of neural responses to speech by directing attention to voices or verbal content., Brain Research, Cognitive Brain Research, 17, 48, 10.1016/S0926-6410(03)00079-X
von Kriegstein, 2010, How the human brain recognizes speech in the context of changing speakers., Journal of Neuroscience, 30, 629, 10.1523/JNEUROSCI.2742-09.2010
Watson, 2012, Sound-induced activity in voice-sensitive cortex predicts voice memory ability., Frontiers in Psychology, 3, 89, 10.3389/fpsyg.2012.00089
Zaske, 2009, In the ear of the beholder: Neural correlates of adaptation to voice gender., European Journal of Neuroscience, 30, 527, 10.1111/j.1460-9568.2009.06839.x