Distributional social semantics: Inferring word meanings from communication patterns

Cognitive Psychology - Tập 131 - Trang 101441 - 2021
Brendan T. Johns1
1Department of Psychology, McGill University, 2001 McGill College Avenue, Montreal, Quebec H3A 1G1, Canada

Tài liệu tham khảo

Abbot-Smith, 2006, Exemplar-learning and schematization in a usage-based account of syntactic acquisition, The Linguistic Review, 23, 275, 10.1515/TLR.2006.011 Adelman, 2006, Contextual diversity, not word frequency, determines word-naming and lexical decision times, Psychological Science, 17, 814, 10.1111/j.1467-9280.2006.01787.x Adelman, 2008, Modeling lexical decision: The form of frequency and diversity effects, Psychological Review, 115, 214, 10.1037/0033-295X.115.1.214 Andrews, 2009, Integrating experiential and distributional data to learn semantic representations, Psychological Review, 116, 463, 10.1037/a0016261 Balota, 2004, Visual word recognition of single-syllable words, Journal of Experimental Psychology: General, 133, 283, 10.1037/0096-3445.133.2.283 Balota, 2007, The English Lexicon Project, Behavior Research Methods, 39, 445, 10.3758/BF03193014 Bannard, 2009, Modeling children's early grammatical knowledge, Proceedings of the National Academy of Sciences, 106, 17284, 10.1073/pnas.0905638106 Baroni, 2014, Don’t count, predict! a systematic comparison of context-counting vs. context-predicting semantic vectors, 238 Barsalou, 1999, Perceptual symbol systems, Behavioral and Brain Sciences, 22, 577, 10.1017/S0140525X99002149 Barsalou, 2008, Grounded cognition, Annual Review of Psychology, 59, 617, 10.1146/annurev.psych.59.103006.093639 Baumgartner, J., Zannettou, S., Keegan, B., Squire, M., & Blackburn, J. (2020, May). The pushshift reddit dataset. In Proceedings of the International AAAI Conference on Web and Social Media (Vol. 14, pp. 830-839). Beckner, 2009, Language is a complex adaptive system: Position paper, Language Learning, 59, 1, 10.1111/j.1467-9922.2009.00533.x Bhatia, 2017, Associative judgment and vector space semantics, Psychological Review, 124, 1, 10.1037/rev0000047 Bhatia, 2019, Distributed semantic representations for modelling human judgment, Current Opinion in Behavioral Sciences, 29, 31, 10.1016/j.cobeha.2019.01.020 Bhatia, 2018, Naturalistic multiattribute choice, Cognition, 179, 71, 10.1016/j.cognition.2018.05.025 Bock, 1986, Syntactic persistence in language production, Cognitive Psychology, 18, 355, 10.1016/0010-0285(86)90004-6 Borghi, 2014, Vol. 2 Bruni, 2014, Multimodal distributional semantics, Journal of Artificial Intelligence Research, 49, 1, 10.1613/jair.4135 Brysbaert, 2009, Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English, Behavior Research Methods, 41, 977, 10.3758/BRM.41.4.977 Brysbaert, 2016, How many words do we know? Practical estimates of vocabulary size dependent on word definition, the degree of language input and the participant’s age, Frontiers in Psychology, 7, 1116, 10.3389/fpsyg.2016.01116 Brysbaert, 2019, Word prevalence norms for 62,000 English lemmas, Behavior Research Methods, 51, 467, 10.3758/s13428-018-1077-9 Buchanan, 2001, Characterizing semantic space: Neighborhood effects in word recognition, Psychonomic Bulletin & Review, 8, 531, 10.3758/BF03196189 Bullinaria, 2007, Extracting semantic representations from word co-occurrence statistics: A computational study, Behavior Research Methods, 39, 510, 10.3758/BF03193020 Bullinaria, 2012, Extracting semantic representations from word co-occurrence statistics: Stop-lists, stemming, and SVD, Behavior Research Methods, 44, 890, 10.3758/s13428-011-0183-8 Bruni, E., Boleda, G., Baroni, M., & Tran, N. K. (2012). Distributional semantics in technicolor. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1 (pp. 136-145). Association for Computational Linguistics. Caliskan, 2017, Semantics derived automatically from language corpora contain human-like biases, Science, 356, 183, 10.1126/science.aal4230 Christiansen, 2008, Language as shaped by the brain, Behavioral and Brain Sciences, 31, 489, 10.1017/S0140525X08004998 Coltheart, 2001, DRC: A dual route cascaded model of visual word recognition and reading aloud, Psychological Review, 108, 204, 10.1037/0033-295X.108.1.204 Crump, 2020, Controlling the retrieval of general versus specific semantic knowledge in the instance theory of semantic memory De Deyne, 2021, Visual and Affective Multimodal Models of Word Meaning in Language and Mind, Cognitive Science, 45, 10.1111/cogs.12922 Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G., & Ruppin, E. (2001). Placing search in context: The concept revisited. In Proceedings of the 10th international conference on World Wide Web (pp. 406-414). ACM. Firth, 1957 Gärdenfors, 2018, Levels of communication and lexical semantics, Synthese, 195, 549, 10.1007/s11229-014-0493-3 Glenberg, 2000, Symbol grounding and meaning: A comparison of high-dimensional and embodied theories of meaning, Journal of Memory and Language, 43, 379, 10.1006/jmla.2000.2714 Goh, 2020, The Auditory English Lexicon Project: A multi-talker, multi-region psycholinguistic database of 10,170 spoken words and nonwords, Behavior Research Methods, 52, 2202, 10.3758/s13428-020-01352-0 Goldberg, Y., Levy, O., 2014. word2vec Explained: deriving Mikolov et al.’s negative-sampling word-embedding method. arXiv preprint arXiv:1402.3722. Goldinger, 1998, Echoes of echoes? An episodic trace theory of lexical access, Psychological Review, 105, 251, 10.1037/0033-295X.105.2.251 Griffiths, 2007, Topics in semantic representation, Psychological Review, 114, 211, 10.1037/0033-295X.114.2.211 Günther, 2019, Vector-space models of semantic representation from a cognitive perspective: A discussion of common misconceptions, Perspectives on Psychological Science, 14, 1006, 10.1177/1745691619861372 Halawi, G., Dror, G., Gabrilovich, E., & Koren, Y. (2012). Large-scale learning of word relatedness with constraints. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and Data Mining (pp. 1406-1414). ACM. Herdağdelen, 2017, Social media and language processing: How Facebook and Twitter provide the best frequency estimates for studying word recognition, Cognitive Science, 41, 976, 10.1111/cogs.12392 Hills, 2012, Optimal foraging in semantic memory, Psychological Review, 119, 431, 10.1037/a0027373 Hintzman, 1986, “Schema abstraction” in a multiple-trace memory model, Psychological Review, 93, 411, 10.1037/0033-295X.93.4.411 Hoffman, 2013, Semantic diversity: A measure of semantic ambiguity based on variability in the contextual usage of words, Behavior Research Methods, 45, 718, 10.3758/s13428-012-0278-x Hollis, 2017, Estimating the average need of semantic knowledge from distributional semantic models, Memory and Cognition, 45, 1350, 10.3758/s13421-017-0732-1 Hollis, 2020, Delineating linguistic contexts, and the validity of context diversity as a measure of a word’s contextual variability, Journal of Memory and Language, 114, 104146, 10.1016/j.jml.2020.104146 Hsiao, 2018, Semantic diversity, frequency and the development of lexical quality in children’s word reading, Journal of Memory and Language, 103, 114, 10.1016/j.jml.2018.08.005 Jamieson, 2018, An instance theory of semantic memory, Computational Brain & Behavior, 1, 119, 10.1007/s42113-018-0008-2 Johns, 2010, Evaluating the random representation assumption of lexical semantics in cognitive models, Psychonomic Bulletin & Review, 17, 662, 10.3758/PBR.17.5.662 Johns, 2012, A synchronization account of false recognition, Cognitive Psychology, 65, 486, 10.1016/j.cogpsych.2012.07.002 Johns, 2012, Perceptual Inference through global lexical similarity, Topics in Cognitive Science, 4, 103, 10.1111/j.1756-8765.2011.01176.x Johns, 2015, Generating structure from experience: A retrieval-based model of language processing, Canadian Journal of Experimental Psychology, 69, 233, 10.1037/cep0000053 Johns, 2016, The influence of contextual diversity on word learning, Psychonomic Bulletin & Review, 23, 1214, 10.3758/s13423-015-0980-7 Johns, 2021, Content matters: Measures of contextual diversity must consider semantic content, PsyArXiv. Johns, 2018, Cognitive modeling as an interface between brain and behavior: Measuring the semantic decline in mild cognitive impairment, Canadian Journal of Experimental Psychology, 72, 117, 10.1037/cep0000132 Johns, 2018, A large-scale analysis of variance in written language, Cognitive Science, 42, 1360, 10.1111/cogs.12583 Johns, 2019, The influence of time and place on lexical behavior: A distributional analysis, Behavior Research Methods, 51, 2438, 10.3758/s13428-019-01289-z Johns, 2019, The role of negative information in distributional semantic learning, Cognitive Science, 43, e12730, 10.1111/cogs.12730 Johns, 2019, Using experiential optimization to build lexical representations, Psychonomic Bulletin & Review, 26, 103, 10.3758/s13423-018-1501-2 Johns, 2021, A continuous source reinstatement model of true and false recollection, Canadian Journal of Experimental Psychology, 75, 1, 10.1037/cep0000237 Johns, 2020, Estimating the prevalence and diversity of words in written language, Quarterly Journal of Experimental Psychology, 73, 841, 10.1177/1747021819897560 Johns, 2020, The continued importance of theory: Lessons from big data approaches to cognition Johns, 2020, Production without rules: Using an instance memory model to exploit structure in natural language, Journal of Memory and Language, 115, 104165, 10.1016/j.jml.2020.104165 Johns, 2019, Mining a crowdsourced dictionary to understand consistency and preference in word meanings, Frontiers in Psychology, 10, 10.3389/fpsyg.2019.00268 Johns, 2021, Disentangling contextual diversity: Communicative need as a lexical organizer, Psychological Review, 128, 525, 10.1037/rev0000265 Johns, B. T. (in press). Accounting for item-level variance in recognition memory: Comparing word frequency and contextual diversity. Memory & Cognition. Jones, 2007, Representing word meaning and order information in a composite holographic lexicon, Psychological Review, 114, 1, 10.1037/0033-295X.114.1.1 Jones, 2012, The role of semantic diversity in lexical organization, Canadian Journal of Experimental Psychology, 66, 115, 10.1037/a0026727 Jones, M. N., Willits, J., & Dennis, S. (2014). Models of semantic memory. In J. R. Busemeyer & J. T. Townsend (Eds.) Oxford Handbook of Mathematical and Computational Psychology. Jones, M. N., Dye, M., & Johns, B. T. (2017). Context as an organizational principle of the lexicon. In B. Ross (Ed.), The Psychology of Learning and Motivation, 67:43. Jones, 2018, When does abstraction occur in semantic memory: Insights from distributional models, Language, Cognition and Neuroscience, 34, 1338, 10.1080/23273798.2018.1431679 Keuleers, 2012, The British Lexicon Project: Lexical decision data for 28,730 monosyllabic and disyllabic English words, Behavior Research Methods, 44, 287, 10.3758/s13428-011-0118-4 Kumar, 2020, Semantic memory: A review of methods, models, and current challenges, Psychonomic Bulletin & Review, 1 Kwantes, 2005, Using context to build semantics, Psychonomic Bulletin & Review, 12, 703, 10.3758/BF03196761 Landauer, 1997, A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge, Psychological Review, 104, 211, 10.1037/0033-295X.104.2.211 Lazaridou, 2017, Multimodal word meaning induction from minimal exposure to natural text, Cognitive Science, 41, 677, 10.1111/cogs.12481 Levy, 2014, Neural word embedding as implicit matrix factorization, 2177 Levy, 2015, Improving distributional similarity with lessons learned from word embedding, Transactions of the Association for Computational Linguistics, 3, 211, 10.1162/tacl_a_00134 Lieven, 1997, Lexically-based learning and early grammatical development, Journal of Child Language, 24, 187, 10.1017/S0305000996002930 Lynott, 2019, The Lancaster Sensorimotor Norms: Multidimensional measures of perceptual and action strength for 40,000 English words, Behavior Research Methods, 52, 1271, 10.3758/s13428-019-01316-z Mak, 2021, Anchoring and contextual variation in the early stages of incidental word learning during reading, Journal of Memory and Language, 118, 104203, 10.1016/j.jml.2020.104203 Mandera, 2020, Recognition times for 62 thousand English words: Data from the English Crowdsourcing Project, Behavior Research Methods, 52, 741, 10.3758/s13428-019-01272-8 Marelli, 2015, Affixation in semantic space: Modeling morpheme meanings with compositional distributional semantics, Psychological Review, 122, 485, 10.1037/a0039267 Marelli, 2017, Compounding as Abstract Operation in Semantic Space: Investigating relational effects through a large-scale, data-driven computational model, Cognition, 166, 207, 10.1016/j.cognition.2017.05.026 McRae, 2005, Semantic feature production norms for a large set of living and nonliving things, Behavior Research Methods, 37, 547, 10.3758/BF03192726 Mewhort, 2018, Release from PI: An analysis and a model, Psychonomic Bulletin & Review, 25, 932, 10.3758/s13423-017-1327-3 Mikolov, 2013, Distributed representations of words and phrases and their compositionality, 3111 Murray, 2004, Serial mechanisms in lexical access: The rank hypothesis, Psychological Review, 111, 721, 10.1037/0033-295X.111.3.721 Osth, 2020, Global semantic similarity effects in recognition memory: Insights from BEAGLE representations and the diffusion decision model, Journal of Memory and Language, 111, 104071, 10.1016/j.jml.2019.104071 van Overschelde, 2004, Category norms: An updated and expanded version of the norms, Journal of Memory and Language, 50, 289, 10.1016/j.jml.2003.10.003 Park, 2018, Examining thematic similarity, difference, and membership in three online mental health communities from Reddit: A text mining and visualization approach, Computers in Human Behavior, 78, 98, 10.1016/j.chb.2017.09.001 Pickering, 2008, Structural priming: A critical review, Psychological Bulletin, 134, 427, 10.1037/0033-2909.134.3.427 Radinsky, K., Agichtein, E., Gabrilovich, E., & Markovitch, S. (2011, March). A word at a time: Computing word relatedness using temporal semantic analysis. In Proceedings of the 20th International Conference on World Wide Web (pp. 337-346). ACM. Recchia, 2009, More data trumps smarter algorithms: Comparing pointwise mutual information to latent semantic analysis, Behavior Research Methods, 41, 657, 10.3758/BRM.41.3.647 Recchia, 2015, Reproducing affective norms with lexical co-occurrence statistics: Predicting valence, arousal, and dominance, Quarterly Journal of Experimental Psychology, 68, 1584, 10.1080/17470218.2014.941296 Recchia, 2015, Encoding sequential information in semantic space models: Comparing holographic reduced representation and random permutation, Computational Intelligence and Neuroscience, 2015, 1, 10.1155/2015/986574 Riordan, 2011, Redundancy in perceptual and linguistic experience: Comparing feature-based and distributional models of semantic representation, Topics in Cognitive Science, 3, 303, 10.1111/j.1756-8765.2010.01111.x Rubenstein, 1965, Contextual correlates of synonymy, Communications of the ACM, 8, 627, 10.1145/365628.365657 Schwartz, 2013, Personality, gender, and age in the language of social media: The open-vocabulary approach, PLoS ONE, 8, e73791, 10.1371/journal.pone.0073791 Shaoul, 2010, Exploring lexical co-occurrence space using HiDEx, Behavior Research Methods, 42, 393, 10.3758/BRM.42.2.393 Shaoul, 2010 Shiffrin, 2008, A survey of model evaluation approaches with a tutorial on hierarchical Bayesian methods, Cognitive Science, 32, 1248, 10.1080/03640210802414826 Shiffrin, 1997, A model for recognition memory: REM—retrieving effectively from memory, Psychonomic Bulletin & Review, 4, 145, 10.3758/BF03209391 Snefjella, 2020, How emotion is learned: Semantic learning of novel words in emotional contexts, Journal of Memory and Language, 115, 104171, 10.1016/j.jml.2020.104171 Taler, 2013, A computational analysis of semantic structure in bilingual fluency, Journal of Memory and Language, 69, 607, 10.1016/j.jml.2013.08.004 Taler, 2020, A large scale semantic analysis of verbal fluency across the aging spectrum: Data from the Canadian longitudinal study on aging, Journal of Gerontology: Psychological Sciences, 75, e221, 10.1093/geronb/gbz003 Tomasello, 2003 Tomasello, 2009 Westbury, 2019, Conceptualizing syntactic categories as semantic categories: Unifying part-of-speech identification and semantics using co-occurrence vector averaging, Behavior Research Methods, 51, 1371, 10.3758/s13428-018-1118-4 Yap, 2011, Is more always better? Effects of semantic richness on lexical decision, speeded pronunciation, and semantic classification, Psychonomic Bulletin & Review, 18, 742, 10.3758/s13423-011-0092-y