The Large‐Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth

Cognitive Science - Tập 29 Số 1 - Trang 41-78 - 2005
Mark Steyvers1, Joshua B. Tenenbaum2
1Department of Cognitive Sciences, University of California, Irvine
2Department of Brain & Cognitive Sciences Massachusetts Institute of Technology

Tóm tắt

Abstract

We present statistical analyses of the large‐scale structure of 3 types of semantic networks: word associations, WordNet, and Roget's Thesaurus. We show that they have a small‐world structure, characterized by sparse connectivity, short average path lengths between words, and strong local clustering. In addition, the distributions of the number of connections follow power laws that indicate a scale‐free pattern of connectivity, with most nodes having relatively few connections joined together through a small number of hubs with many connections. These regularities have also been found in certain other complex natural networks, such as the World Wide Web, but they are not consistent with many conventional models of semantic organization, based on inheritance hierarchies, arbitrarily structured networks, or high‐dimensional vector spaces. We propose that these structures reflect the mechanisms by which semantic networks grow. We describe a simple model for semantic growth, in which each new word or concept is connected to an existing network by differentiating the connectivity pattern of an existing node. This model generates appropriate small‐world statistics and power‐law connectivity distributions, and it also suggests one possible mechanistic basis for the effects of learning history variables (age of acquisition, usage frequency) on behavioral performance in semantic processing tasks.

Từ khóa


Tài liệu tham khảo

10.1007/3-540-48155-9_27

Adamic L. A.(2000).Zipf power‐laws and pareto‐A ranking tutorial. Retrieved July 1 2003 fromhttp:www.hpl.hp.comresearchidlpapersrankingranking.html.

10.1103/RevModPhys.74.47

10.1038/43601

10.1038/35019019

10.1073/pnas.200327197

Anderson J. R., 2000, Learning and memory: An integrated approach

Balota D. A., 1999, Abstracts of the 40th Annual Meeting of the Psychonomics Society, 44

10.1126/science.286.5439.509

Block N.(1998).Semantics conceptual role.Routledge Encyclopedia of Philosophy Online. Retrieved fromhttp:www.rep.routledge.com.

Bollobas B., 1985, Random graphs

Brand M. ReyA. Peereman R. &Spieler D.(2001).Naming bisyllabic words: A large scale study. Paper presented at the 12th Conference of the European Society for Cognitive Psychology (ESCOP 2001) Edinburgh Scotland.

10.1016/S0169-7552(98)00110-X

10.1037/h0041727

Brown R., 1958, Words and things

10.1016/S0001-6918(00)00021-4

Carey S., 1978, Linguistic theory and psychological reality, 264

Carey S., 1985, Neonate cognition, 381

10.1080/14640747308400325

10.1515/9783112316009

Chomsky N., 1959, A review of B. F. Skinner's, Verbal behavior. Language, 35, 26

10.1017/CBO9780511554377

Clark E. V., 2001, Stanford papers in semantics, 23

10.1037/0033-295X.82.6.407

10.1016/S0022-5371(69)80069-1

Cormen T., 1990, Introduction to algorithms

Deese J., 1965, The structure of associations in language and thought

10.1037/0278-7393.26.5.1103

Erdüs P., 1960, On the evolution of random graphs, Publications of the Mathematical Institute of the Hungarian Academy of Sciences, 5, 17

Fellbaum C., 1998, WordNet, an electronic lexical database, 10.7551/mitpress/7287.001.0001

10.1093/0198236360.001.0001

10.1080/016909698386537

Gentner D., 1981, Some interesting differences between nouns and verbs, Cognition and Brain Theory, 4, 161

10.3758/BF03201693

10.1073/pnas.0307752101

Griffiths T. L., 2002, Proceedings of the Twenty‐Fourth Annual Conference of the Cognitive Science Society, 381

10.1080/09658219508253161

10.1006/brln.1998.1960

10.1038/35036627

Kandel E. R., 1991, Principles of neural science

10.4159/harvard.9780674181816

10.1038/35022643

Kleinberg J. M., 2002, Advances in neural information processing systems, 431

Kucera H., 1967, Computational analysis of present‐day American English

10.1016/S0028-3932(97)00169-3

10.1037/0033-295X.104.2.211

10.1080/01638539809545028

Levin B., 1993, English verb classes and alternations: A preliminary investigation

10.1016/S0010-0277(00)00117-7

Macnamara J., 1982, Names for things: A study in human learning

Manning C. D., 1999, Foundations of statistical natural language processing

Milgram S., 1967, The small‐world problem, Psychology Today, 2, 60

Miller G.A., 1995, WordNet: Anon‐line lexical database [Special issue], International Journal of Lexicography, 3, 4

10.1080/027249897392017

Nelson D. L. McEvoy C. L. &Schreiber T. A.(1999).The University of South Florida word association norms. Retrieved fromhttp:w3.usf.eduFreeAssociation.

10.1037/0033-295X.105.2.299

10.1073/pnas.98.2.404

10.1023/A:1005319718167

10.1073/pnas.96.14.8028

10.7551/mitpress/2014.001.0001

10.1037/e412952005-009

10.1037/0033-295X.103.1.56

10.1016/S0022-5371(73)80056-8

10.7551/mitpress/6161.001.0001

Roget P. M.(1911).Roget's Thesaurus of English Words and Phrases(1911 ed.). Retrieved October 28 2004 fromhttp:www.gutenberg.orgetext10681.

10.1093/biomet/42.3-4.425

Skinner B. F., 1937, The distribution of associated words, Psychological Record, 1, 71, 10.1007/BF03393192

Slobin D. I., 1973, Studies of child language development, 173

10.1006/cogp.1997.0672

10.1207/s15516709cog2202_2

Smith M. A., 2001, Advances in neural information processing systems, 52

10.1007/BF02378925

Stoyan D., 1995, Stochastic geometry and its applications

10.1038/35065725

10.3758/BF03201200

10.1037/0033-295X.84.4.327

10.1037/0033-295X.93.1.3

10.1080/14640747508400525

10.1515/9780691188331

10.1038/30918

Zipf G. K., 1965, Human behavior and the principle of least effort