Dynamically adapted context-specific hyper-articulation: Feedback from interlocutors affects speakers’ subsequent pronunciations

Journal of Memory and Language - Tập 89 - Trang 68-86 - 2016
Esteban Buz1, Michael K. Tanenhaus1,2, T. Florian Jaeger1,2,3
1Department of Brain and Cognitive Sciences, University of Rochester, United States
2Department of Linguistics, University of Rochester, United States
3Department of Computer Science, University of Rochester, United States

Tài liệu tham khảo

Arnold, 2008, Reference production: Production-internal and addressee-oriented processes, Language and Cognitive Processes, 23, 495, 10.1080/01690960801920099 Arnold, 2012, Audience design affects acoustic reduction via production facilitation, Psychonomic Bulletin & Review, 19, 505, 10.3758/s13423-012-0233-y Arnold, 2007, If you say thee uh you are describing something hard: The on-line attribution of disfluency during reference comprehension, Journal of Experimental Psychology: Learning, Memory, and Cognition, 33, 914 Arnold, 2015, Synthesising meaning and processing approaches to prosody: Performance matters, Language, Cognition and Neuroscience, 30, 88, 10.1080/01690965.2013.840733 Babel, 2010, Dialect divergence and convergence in New Zealand English, Language in Society, 39, 437, 10.1017/S0047404510000400 Baese-Berk, 2009, Mechanisms of interaction in speech production, Language and Cognitive Processes, 24, 527, 10.1080/01690960802299378 Bard, 2000, Controlling the intelligibility of referring expressions in dialogue, Journal of Memory and Language, 42, 1, 10.1006/jmla.1999.2667 Bell, 1984, Language style as audience design, Language in Society, 13, 145, 10.1017/S004740450001037X Bell, 2009, Predictability effects on durations of content and function words in conversational English, Journal of Memory and Language, 60, 92, 10.1016/j.jml.2008.06.003 Boersma, P., & Weenink, D. (2014). Praat: Doing phonetics by computer, version 5.4.08. Retrieved from http://www.praat.org/ Brown, 1987, Adapting production to comprehension: The explicit mention of instruments, Cognitive Psychology, 19, 441, 10.1016/0010-0285(87)90015-6 Brown-Schmidt, 2008, Real-time investigation of referential domains in unscripted conversation: A targeted language game approach, Cognitive Science, 32, 643, 10.1080/03640210802066816 Brown-Schmidt, 2015, People as contexts in conversation, 59, 10.1016/bs.plm.2014.09.003 Burnham, 2002, What’s new, pussycat? On talking to babies and animals, Science, 296, 1435, 10.1126/science.1069587 Buz, 2015, The (in)dependence of articulation and lexical planning during isolated word production, Language, Cognition and Neuroscience Buz, 2014, Contextual confusability leads to targeted hyperarticulation, 1970 Campbell-Kibler, 2010, The sociolinguistic variant as a carrier of social meaning, Language Variation and Change, 22, 423, 10.1017/S0954394510000177 Chodroff, E., Godfrey, J., Khudanpur, S., & Wilson, C. (2015). Structured variability in acoustic realization: A corpus study of voice onset time in American English stops. In The Scottish Consortium for ICPhS 2015 (Ed.), Proceedings of the 18th international congress of phonetic sciences. Glasgow, UK: The University of Glasgow. Clayards, 2008, Perception of speech reflects optimal use of probabilistic speech cues, Cognition, 108, 804, 10.1016/j.cognition.2008.04.004 Dell, 1991, Mechanisms for listener-adaptation in language production: Limiting the role of the “model of the listener”, 105 Finegan, 2001, Register variation and social dialect variation: The register axiom, 235 Foulkes, 2015, The emergence of sociophonetic structure, 292 Fowler, 1987, Talkers’ signaling of “new” and “old” words in speech and listeners’ perception and use of the distinction, Journal of Memory and Language, 26, 489, 10.1016/0749-596X(87)90136-7 Fox, 2015, Phonological neighborhood competition affects spoken word production irrespective of sentential context, Journal of Memory and Language, 83, 97, 10.1016/j.jml.2015.04.002 Fox Tree, 1997, Pronouncing “the” as “thee” to signal problems in speaking, Cognition, 62, 151, 10.1016/S0010-0277(96)00781-0 Fricke, M., Baese-Berk, M.M., & Goldrick, M. (2016). Dimensions of similarity in the mental lexicon. Language, Cognition and Neuroscience. (Advance online publication). http://dx.doi.org/10.1080/23273798.2015.1130234. Galati, 2010, Attenuating information in spoken communication: For the speaker, or for the addressee?, Journal of Memory and Language, 62, 35, 10.1016/j.jml.2009.09.002 Giles, 1991, Accomodation theory: Communication, context, and consequence, 1 Goldinger, 1998, Echoes of echoes? An episodic theory of lexical access, Psychological Review, 105, 251, 10.1037/0033-295X.105.2.251 Goldrick, 2014, Gradient co-activation and speech error articulation: Comment on Pouplier and Goldstein (2010), Language, Cognition and Neuroscience, 29, 452, 10.1080/01690965.2013.807347 Goldrick, 2010, Mrs. Malaprop’s neighborhood: Using word errors to reveal neighborhood structure, Journal of Memory and Language, 62, 113, 10.1016/j.jml.2009.11.008 Goldrick, 2013, The effects of lexical neighbors on stop consonant articulation, The Journal of the Acoustical Society of America, 134, EL172, 10.1121/1.4812821 Gruenstein, A., McGraw, I., & Badr, I. (2008). The WAMI toolkit for developing, deploying and evaluating web-accessible multimodal interfaces. In 10th International conference on multimodal interfaces. Guy, 1996, Form and function in linguistic variation, Vol. 1, 221 Hanulíková, 2012, When one person’s mistake is another’s standard usage: The effect of foreign accent on syntactic processing, Journal of Cognitive Neuroscience, 24, 878, 10.1162/jocn_a_00103 Harris, 1998, Signal-dependent noise determines motor planning, Nature, 394, 780, 10.1038/29528 Hay, 2000, Functions of humor in the conversations of men and women, Journal of Pragmatics, 32, 709, 10.1016/S0378-2166(99)00069-7 Heller, 2009, The real-time use of information about common ground in restricing domains of reference Hickok, G. (2012). Computational neuroanatomy of speech production (Vol. 13, pp. 135–145). http://dx.doi.org/10.1038/nrn3158. Horton, 2005, Conversational common ground and memory processes in language production, Discourse Processes, 40, 1, 10.1207/s15326950dp4001_1 Horton, W. S., & Gerrig, R. J. (in press). Revisiting the memory-based processing approach to common ground. Topics in Cognitive Science. Horton, 1996, When do speakers take into account common ground?, Cognition, 59, 91, 10.1016/0010-0277(96)81418-1 Houde, 1998, Sensorimotor adaptation in speech production, Science, 279, 1213, 10.1126/science.279.5354.1213 Houde, 2011, Speech production as state feedback control, Frontiers in Human Neuroscience, 5, 1, 10.3389/fnhum.2011.00082 Huettig, 2010, Listening to yourself is like listening to others: External, but not internal, verbal self-monitoring is based on speech perception, Language and Cognitive Processes, 25, 347, 10.1080/01690960903046926 Jacobs, 2015, Why are repeated words produced with reduced durations? Evidence from inner speech and homophone production, Journal of Memory and Language, 84, 37, 10.1016/j.jml.2015.05.004 Jaeger, 2013, Production preferences cannot be understood without reference to communication, Frontiers in Psychology, 4, 1, 10.3389/fpsyg.2013.00230 Jaeger, T. F., & Buz, E. (in press). Signal reduction and linguistic encoding. In E. M. Fernández, & H. S. Cairns (Eds.), Handbook of psycholinguistics. Wiley-Blackwell. Jaeger, 2013, Seeking predictions from a predictive framework, The Behavioral and Brain Sciences, 36, 359, 10.1017/S0140525X12002762 Jaeger, 2012, Incremental phonological encoding during unscripted sentence production, Frontiers in Psychology, 3, 1, 10.3389/fpsyg.2012.00481 Jaeger, 2012, Phonological overlap affects lexical selection during sentence production, Journal of Experimental Psychology: Learning, Memory, and Cognition, 38, 1439 Jaeger, T. F., & Grimshaw, J. (2013). Information density affects both production and grammatical constraints. In 19th Architecture and mechanisms for language processing. Marseille, France. Johnson, 1997, Speech perception without speaker normalization: An exemplar model, 145 Jokinen, 2009, Gaze and gesture activity in communication, Vol. 5615, 537 de Jong, 2004, Stress, lexical focus, and segmental focus in English: Patterns of variation in vowel duration, Journal of Phonetics, 32, 493, 10.1016/j.wocn.2004.05.002 Kahn, 2012, A processing-centered look at the contribution of givenness to durational reduction, Journal of Memory and Language, 67, 311, 10.1016/j.jml.2012.07.002 Kang, 2008, Clear speech production of Korean stops: Changing phonetic targets and enhancement strategies, The Journal of the Acoustical Society of America, 124, 3909, 10.1121/1.2988292 Keysar, 1998, The egocentric basis of language use: Insights from a processing approach, Current Directions in Psychological Science, 7, 46, 10.1111/1467-8721.ep13175613 Kirov, 2012, The specificity of online variation in speech production, 587 Kirov, 2013, Bayesian speech production: Evidence from latency and hyperarticulation, 788 Klatt, 1976, Linguistic uses of segmental duration in English: Acoustic and perceptual evidence, The Journal of the Acoustical Society of America, 59, 1208, 10.1121/1.380986 Kleinschmidt, 2012, A continuum of phonetic adaptation: Evaluating an incremental belief-updating model of recalibration and selective adaptation, 599 Kleinschmidt, 2015, Robust speech perception: Recognize the familiar, generalize to the similar, and adapt to the novel, Psychological Review, 122, 148, 10.1037/a0038695 Kleinschmidt, 2015, Supervised and unsupervised learning in phonetic adaptation, 1129 Kohler, 1990, Segmental reduction in connected speech in German: Phonological facts and phonetic explanations, 69 Kraljic, 2008, First impressions and last resorts: How listeners adjust to speaker variability, Psychological Science, 19, 332, 10.1111/j.1467-9280.2008.02090.x Kuhl, 1997, Cross-language analysis of phonetic units in language addressed to infants, Science, 277, 684, 10.1126/science.277.5326.684 Kuhlen, 2013, Language in dialogue: When confederates might be hazardous to your data, Psychonomic Bulletin & Review, 20, 54, 10.3758/s13423-012-0341-8 Kuperberg, 2016, What do we mean by prediction in language comprehension?, Language, Cognition and Neuroscience, 31, 32, 10.1080/23273798.2015.1102299 Kurumada, C. (2013). Navigating variability in the linguistic signal: Learning to interpret contrastive prosody. Doctoral thesis Stanford University. Lam, 2010, Repetition is easy: Why repeated referents have reduced prominence, Memory & Cognition, 38, 1137, 10.3758/MC.38.8.1137 Levelt, 1999, Producing spoken language: A blueprint of the speaker, 83 Levelt, 1999, A theory of lexical access in speech production, Behavioral and Brain Sciences, 22, 1, 10.1017/S0140525X99001776 Liberman, 1967, Perception of the speech code, Psychological Review, 74, 431, 10.1037/h0020279 Lindblom, 1990, Explaining phonetic variation: A sketch of the H&H theory, 403 Lockridge, 2002, Addressees’ needs influence speakers’ early syntactic choices, Psychonomic Bulletin & Review, 9, 550, 10.3758/BF03196312 Lombard, 1911, Le signe de l’elevation de la voix, Ann Maladies Oreille, Larynx, Nez, Pharynx, 37, 101 Maniwa, 2009, Acoustic characteristics of clearly spoken English fricatives, The Journal of the Acoustical Society of America, 125, 3962, 10.1121/1.2990715 Martin, 2015, Mothers speak less clearly to infants than to adults: A comprehensive test of the hyperarticulation hypothesis, Psychological Science, 26, 341, 10.1177/0956797614562453 McMurray, 2008, Gradient sensitivity to within-category variation in words and syllables, Journal of Experimental Psychology: Human Perception and Performance, 34, 1609 McMurray, 2002, Gradient effects of within-category phonetic variation on lexical access, Cognition, 86, B33, 10.1016/S0010-0277(02)00157-9 Munson, 2007, Lexical access, lexical representation, and vowel production, Vol. 9, 201 Niedzielski, 1999, The effect of social information on the perception of sociolinguistic variables, Journal of Language and Social Psychology, 18, 62, 10.1177/0261927X99018001005 Ohala, 1989, Sound change is drawn from a pool of synchronic variation, 173 Ohala, J. J. (1994). Acoustic study of clear speech: A test of the contrastive hypothesis. In Proceedings of the international symposium on prosody (pp. 75–89). O’Seaghdha, 2000, Phonological competition and cooperation in form-related priming: Sequential and nonsequential processes in word production, Journal of Experimental Psychology: Human Perception and Performance, 26, 57 Oviatt, 1998, Modeling global and focal hyperarticulation during human-computer error resolution, The Journal of the Acoustical Society of America, 104, 3080, 10.1121/1.423888 Oviatt, 1998, Predicting hyperarticulate speech during human-computer error resolution, Speech Communication, 24, 87, 10.1016/S0167-6393(98)00005-3 Pate, 2015, Talkers account for listener and channel characteristics to communicate efficiently, Journal of Memory and Language, 78, 1, 10.1016/j.jml.2014.10.003 Peramunage, 2011, Phonological neighborhood effects in spoken word production: An fMRI study, Journal of Cognitive Neuroscience, 23, 593, 10.1162/jocn.2010.21489 Picheny, 1986, Speaking clearly for the hard of hearing II, Journal of Speech Language and Hearing Research, 29, 434, 10.1044/jshr.2904.434 Pickering, 2004, Toward a mechanistic psychology of dialogue, Behavioral and Brain Sciences, 27, 169, 10.1017/S0140525X04000056 Pickering, 2013, An integrated theory of language production and comprehension, Behavioral and Brain Sciences, 36, 329, 10.1017/S0140525X12001495 Pierrehumbert, 2001, Exemplar dynamics: Word frequency, lenition and contrast, 137 Pierrehumbert, 2002, Word-specific phonetics, Vol. 7, 101 Pouplier, 2010, Intention in articulation: Articulatory timing in alternating consonant sequences and its implications for models of speech production, Language and Cognitive Processes, 25, 616, 10.1080/01690960903395380 Pouplier, 2014, The relationship between planning and execution is more than duration: Response to Goldrick & Chu, Language, Cognition and Neuroscience, 29, 1097, 10.1080/01690965.2013.834063 Purcell, 2006, Compensation following real-time manipulation of formants in isolated vowels, The Journal of the Acoustical Society of America, 119, 2288, 10.1121/1.2173514 Sanchez, 2015, Contextual activation of Australia can affect New Zealanders’ vowel productions, Journal of Phonetics, 48, 76, 10.1016/j.wocn.2014.10.004 Schertz, 2013, Exaggeration of featural contrasts in clarifications of misheard speech in English, Journal of Phonetics, 41, 249, 10.1016/j.wocn.2013.03.007 Schertz, 2015, Individual differences in phonetic cue use in production and perception of a non-native sound contrast, Journal of Phonetics, 52, 183, 10.1016/j.wocn.2015.07.003 Sevald, 1994, The sequential cuing effect in speech production, Cognition, 53, 91, 10.1016/0010-0277(94)90067-1 Seyfarth, S., Buz, E., & Jaeger, T. F. (in press). Dynamic hyperarticulation of coda voicing contrasts. Journal of the Acoustical Society of America. Shintel, 2009, Less is more: A minimalist account of joint action in communication, Topics in Cognitive Science, 1, 260, 10.1111/j.1756-8765.2009.01018.x Staum Casasanto, 2008, Does social information influence sentence processing?, 799 Stent, 2008, Adapting speaking after evidence of misrecognition: Local and global hyperarticulation, Speech Communication, 50, 163, 10.1016/j.specom.2007.07.005 Strand, 1999, Uncovering the role of gender stereotypes in speech perception, Journal of Language and Social Psychology, 18, 86, 10.1177/0261927X99018001006 Todorov, 2002, Optimal feedback control as a theory of motor coordination, Nature Neuroscience, 5, 1226, 10.1038/nn963 Tourville, 2011, The DIVA model: A neural theory of speech acquisition and production, Language and Cognitive Processes, 26, 952, 10.1080/01690960903498424 Trommershäuser, 2005, Optimal compensation for changes in task-relevant movement variability, The Journal of Neuroscience, 25, 7169, 10.1523/JNEUROSCI.1906-05.2005 Trommershäuser, 2008, Decision making, movement planning and statistical decision theory, Trends in Cognitive Sciences, 12, 291, 10.1016/j.tics.2008.04.010 Uther, 2007, Do you speak E-NG-L-I-SH? A comparison of foreigner- and infant-directed speech, Speech Communication, 49, 2, 10.1016/j.specom.2006.10.003 Van Summers, 1988, Effects of noise on speech production: Acoustic and perceptual analyses, Journal of the Acoustical Society of America, 84, 917, 10.1121/1.396660 Villacorta, 2007, Sensorimotor adaptation to feedback perturbations of vowel acoustics and its relation to perception, The Journal of the Acoustical Society of America, 122, 2306, 10.1121/1.2773966 Walker, 2011, Congruence between ‘word age’ and ‘voice age’ facilitates lexical access, Laboratory Phonology, 2, 219, 10.1515/labphon.2011.007 Watson, 2015, The effect of phonological encoding on word duration: Selection takes time, 85 Weatherholtz, 2014, Socially-mediated syntactic alignment, Language Variation and Change, 26, 387, 10.1017/S0954394514000155 Weatherholtz, K., & Jaeger, T. F. (in press). Speech perception and generalization across talkers and accents. Oxford Research Encyclopedia in Linguistics. Weatherholtz, K., Seifeldin, M., Kleinschmidt, D. F., Kurumada, C., & Jaeger, T. F. (submitted for publication). Language processing as probabilistic inference under uncertainty based on social-indexical knowledge. Language and Linguistics Compass. Wedel, 2006, Exemplar models, evolution and language change, The Linguistic Review, 23, 247, 10.1515/TLR.2006.010 Wei, 2008, Relevance of error: What drives motor adaptation?, Journal of Neurophysiology, 101, 655, 10.1152/jn.90545.2008