Higher-order structures of local collaboration networks are associated with individual scientific productivity

Wenlong Yang, Yang Wang1
1School of Public Policy and Administration, Xi’an Jiaotong University, Xi’an, China

Tóm tắt

The prevalence of teamwork in contemporary science has raised new questions about collaboration networks and the potential impact on research outcomes. Previous studies primarily focused on pairwise interactions between scientists when constructing collaboration networks, potentially overlooking group interactions among scientists. In this study, we introduce a higher-order network representation using algebraic topology to capture multi-agent interactions, i.e., simplicial complexes. Our main objective is to investigate the influence of higher-order structures in local collaboration networks on the productivity of the focal scientist. Leveraging a dataset comprising more than 3.7 million scientists from the Microsoft Academic Graph, we uncover several intriguing findings. Firstly, we observe an inverted U-shaped relationship between the number of disconnected components in the local collaboration network and scientific productivity. Secondly, there is a positive association between the presence of higher-order loops and individual scientific productivity, indicating the intriguing role of higher-order structures in advancing science. Thirdly, these effects hold across various scientific domains and scientists with different impacts, suggesting strong generalizability of our findings. The findings highlight the role of higher-order loops in shaping the development of individual scientists, thus may have implications for nurturing scientific talent and promoting innovative breakthroughs.

Từ khóa


Tài liệu tham khảo

Fortunato S, Bergstrom C, Borner K, Evans J, Helbing D, Milojevic S, Petersen A, Radicchi F, Sinatra R, Uzzi B, Vespignani A, Waltman L, Wang D, Barabasi A (2018) Science of science. Science 359(1):6379 Zeng A, Shen Z, Zhou J, Wu J, Fan Y, Wang Y, Stanley H (2017) The science of science: from the perspective of complex systems. Phys Rep 714–715:1–73 Shrum W, Genuth J, Chompalov I (2007) Structures of scientific collaboration. MIT Press, Cambridge Katz J, Martin B (1997) What is research collaboration? Res Policy 26(1):1–18 de Solla Price D (1963) Little science, big science. Columbia University Press, New York Jones B (2009) The burden of knowledge and the “death of the renaissance man”: is innovation getting harder? Rev Econ Stud 76(1):283–317 Newman M (2001) The structure of scientific collaboration networks. Proc Natl Acad Sci USA 98(2):404–409 Wuchty S, Jones B, Uzzi B (2007) The increasing dominance of teams in production of knowledge. Science 316(5827):1036–1039 Jones B, Wuchty S, Uzzi B (2008) Multi-university research teams: shifting impact, geography, and stratification in science. Science 322(5905):1259–1262 Adams J (2013) The fourth age of research. Nature 497(7451):557–560 Newman M (2001) Scientific collaboration networks. I. Network construction and fundamental results. Phys Rev E 64(1):016131 Newman M (2001) Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Phys Rev E 64(1):016132 Newman M (2002) Assortative mixing in networks. Phys Rev Lett 89(20):208701 Ke Q, Ahn Y (2014) Tie strength distribution in scientific collaboration networks. Phys Rev E 90(3):032804 Pan R, Saramaki J (2012) The strength of strong ties in scientific collaboration networks. Europhys Lett 97(1):18007 Martin T, Ball B, Karrer B, Newman M (2013) Coauthorship and citation patterns in the physical review. Phys Rev E 88(1):012814 Ding Y (2011) Scientific collaboration and endorsement: network analysis of coauthorship and citation networks. J Informetr 5(1):187–203 Abbasi A, Hossain L, Uddin S, Rasmussen K (2011) Evolutionary dynamics of scientific collaboration networks: multi-levels and cross-time analysis. Scientometrics 89(2):687–710 Menichetti G, Remondini D, Panzarasa P, Mondragon R, Bianconi G (2014) Weighted multiplex networks. PLoS ONE 9(6):e97857 Tahmooresnejad L, Beaudry C, Mirnezami S (2021) The study of network effects on research impact in Africa. Sci Public Policy 48(4):462–473 Tahmooresnejad L, Beaudry C (2018) The importance of collaborative networks in Canadian scientific research. Ind Innov 25(10):990–1029 Wang J (2016) Knowledge creation in collaboration networks: effects of tie configuration. Res Policy 45(1):68–80 Gonzalez-Brambila C, Veloso F, Krackhardt D (2013) The impact of network embeddedness on research output. Res Policy 42(9):1555–1567 Li E, Liao C, Yen H (2013) Co-authorship networks and research impact: a social capital perspective. Res Policy 42(9):1515–1530 Abbasi A, Altmann J, Hossain L (2011) Identifying the effects of co-authorship networks on the performance of scholars: a correlation and regression analysis of performance measures and social network analysis measures. J Informetr 5(4):594–607 Guan J, Pang L (2018) Bidirectional relationship between network position and knowledge creation in scientometrics. Scientometrics 115(1):201–222 Guan J, Zhang J, Yan Y (2015) The impact of multilevel networks on innovation. Res Policy 44(3):545–559 Fronczak A, Mrowinski M, Fronczak P (2022) Scientific success from the perspective of the strength of weak ties. Sci Rep 12(1):5074 AlShebli B, Rahwan T, Woon W (2018) The preeminence of ethnic diversity in scientific collaboration. Nat Commun 9(1):5163 Dong Y, Ma H, Tang J, Wang K (2018) Collaboration diversity and scientific impact. Preprint. arXiv:1806.03694 Freeman R, Huang W (2014) Strength in diversity. Nature 513(7518):305 Wagner C, Whetsell T, Mukherjee S (2019) International research collaboration: novelty, conventionality, and atypicality in knowledge recombination. Res Policy 48(5):1260–1270 Chen W, Yan Y (2023) New components and combinations: the perspective of the internal collaboration networks of scientific teams. J Informetr 17(2):101407 Liu M, Jaiswal A, Bu Y, Min C, Yang S, Liu Z, Acuna D, Ding Y (2022) Team formation and team impact: the balance between team freshness and repeat collaboration. J Informetr 16(4):101337 Petersen A (2015) Quantifying the impact of weak, strong, and super ties in scientific careers. Proc Natl Acad Sci USA 112(34):E4671–E4680 Xu F, Wu L, Evans J (2022) Flat teams drive scientific innovation. Proc Natl Acad Sci USA 119(23):e2200927119 Yang Y, Tian T, Woodruff T, Jones B, Uzzi B (2022) Gender-diverse teams produce more novel and higher-impact scientific ideas. Proc Natl Acad Sci USA 119(36):e2200841119 Zeng A, Fan Y, Di Z, Wang Y, Havlin S (2021) Fresh teams are associated with original and multidisciplinary research. Nat Hum Behav 5(10):1314–1322 Lin Y, Frey CB, Wu L (2023) Remote collaboration fuses fewer breakthrough ideas. Nature 623(7989):987–991 Horak D, Jost J (2013) Spectra of combinatorial Laplace operators on simplicial complexes. Adv Math 244(2):303–336 Jiang B, Omer I (2007) Spatial topology and its structural analysis based on the concept of simplicial complex. Trans GIS 11(6):943–960 Cooper J, Dutle A (2012) Spectra of uniform hypergraphs. Linear Algebra Appl 436(9):3268–3292 Ghoshal G, Zlatic V, Caldarelli G, Newman M (2009) Random hypergraphs and their applications. Phys Rev E 79(6):066118 Gao T, Li F (2018) Studying the utility preservation in social network anonymization via persistent homology. Comput Secur 77:49–64 Saggar M, Sporns O, Gonzalez-Castillo J, Bandettini P, Carlsson G, Glover G, Reiss A (2018) Towards a new approach to reveal dynamical organization of the brain using topological data analysis. Nat Commun 9(1):1399 Santos F, Raposo E, Coutinho M, Copelli M, Stam C, Douw L (2019) Topological phase transitions in functional brain networks. Phys Rev E 100(3):032414 Mariani M, Ren Z, Bascompte J, Tessone C (2019) Nestedness in complex networks: observation, emergence, and implications. Phys Rep 813:1–90 Valverde S, Vidiella B, Montanez R, Fraile A, Sacristan S, Garcia-Arenal F (2020) Coexistence of nestedness and modularity in host-pathogen infection networks. Nat Ecol Evol 4(4):568–577 Sanchez A (2019) Defining higher-order interactions in synthetic ecology: lessons from physics and quantitative genetics. Cell Syst 9(6):519–520 Guerrero R, Scarpino S, Rodrigues J, Hartl D, Ogbunugafor C (2019) Proteostasis environment shapes higher-order epistasis operating on antibiotic resistance. Genetics 212(2):565–575 Carstens C, Horadam K (2013) Persistent homology of collaboration networks. Math Probl Eng 2013(1):815035 Gebhart T, Funk R (2020) The emergence of higher-order structure in scientific and technological knowledge networks. Preprint. arXiv:2009.13620 Juul J, Benson A, Kleinberg J (2022) Hypergraph patterns and collaboration structure. Preprint. arXiv:2210.02163 Patania A, Petri G, Vaccarino F (2017) The shape of collaborations. EPJ Data Sci 6:18 Salnikov V, Cassese D, Lambiotte R (2018) Co-occurrence simplicial complexes in mathematics: identifying the holes of knowledge. Appl Netw Sci 31(1):37 Reimann M, Nolte M, Scolamiero M, Turner K, Perin R, Chindemi G, Dlotko P, Levi R, Hess K, Markram H (2017) Cliques of neurons bound into cavities provide a missing link between structure and function. Front Comput Neurosci 11:48 Sizemore A, Giusti C, Kahn A, Vettel J, Betzel R, Bassett D (2018) Cliques and cavities in the human connectome. J Comput Neurosci 44(1):115–145 Milojevic S (2014) Principles of scientific research team formation and evolution. Proc Natl Acad Sci USA 111(11):3984–3989 Wu L, Wang D, Evans J (2019) Large teams develop and small teams disrupt science and technology. Nature 566(7744):378 Wang Y, Li N, Zhang B, Huang Q, Wu J, Wang Y (2023) The effect of structural holes on producing novel and disruptive research in physics. Scientometrics 128(3):1801–1823 Wang C, Rodan S, Fruin M, Xu XY (2014) Knowledge networks, collaboration networks, and exploratory innovation. Acad Manag J 57(2):484–514 Liu F, Holme P, Chiesa M, AlShebli B, Rahwan T (2023) Gender inequality and self-publication are common among academic editors. Nat Hum Behav 7(3):353–364 Liu F, Rahwan T, AlShebli B (2023) Non-white scientists appear on fewer editorial boards, spend more time under review, and receive fewer citations. Proc Natl Acad Sci USA 120(13):e2215324120 AlShebli B, Makovi K, Rahwan T (2020) The association between early career informal mentorship in academic collaborations and junior author performance. Nat Commun 11(1):6446 Sun Y, Livan G, Ma A, Latora V (2021) Interdisciplinary researchers attain better long-term funding performance. Commun Phys 4(1):263 Xie Y, Lin XH, Li J, He Q, Huang JM (2023) Caught in the crossfire: fears of Chinese-American scientists. Proc Natl Acad Sci USA 120(27):e2216248120 Huang J, Gates A, Sinatra R, Barabasi A (2020) Historical comparison of gender inequality in scientific careers across countries and disciplines. Proc Natl Acad Sci USA 117(9):4609–4616 Zeng A, Fan Y, Di ZG, Wang YG, Havlin S (2022) Impactful scientists have higher tendency to involve collaborators in new topics. Proc Natl Acad Sci USA 119(33):e2207436119 Wang K, Shen Z, Huang C, Wu C-H, Dong Y, Kanakia A (2020) Microsoft academic graph: when experts are not enough. Quant Sci Stud 1(1):396–413 Zhang L, Lu W, Yang J (2021) LAGOS-AND: a large gold standard dataset for scholarly author name disambiguation. Preprint. arXiv:2104.01821 Battiston F, Cencetti G, Iacopini I, Latora V, Lucas M, Patania A, Young JG, Petri G (2020) Networks beyond pairwise interactions: structure and dynamics. Phys Rep 874(1):1–92 Bianconi G (2021) Higher-order networks: an introduction to simplicial complexes. Cambridge University Press, Cambridge Carlsson G (2009) Topology and data. Bull Am Math Soc 46(2):255–308 Horak D, Maletic S, Rajkovic M (2009) Persistent homology of complex networks. J Stat Mech Theory Exp 2009(3):P03034 Otter N, Porter M, Tillmann U, Grindrod P, Harrington H (2017) A roadmap for the computation of persistent homology. EPJ Data Sci 6:17 Blau DM, Weinberg BA (2017) Why the US science and engineering workforce is aging rapidly. Proc Natl Acad Sci USA 114(15):3879–3884 Fronczak P, Fronczak A, Holyst JA (2007) Analysis of scientific productivity using maximum entropy principle and fluctuation-dissipation theorem. Phys Rev E 75(2):026103 Gourieroux C, Monfort A, Trognon A (1984) Pseudo maximum-likelihood methods – applications to Poisson models. Econometrica 52(3):701–720 Angrist J, Pischke J (2009) Mostly harmless econometrics: an empiricist’s companion. Princeton University Press, Princeton Dehaan E (2021) Using and interpreting fixed effects models. Working paper, University of Washington. www.ssrn.com/abstract_id=3699777 Granovetter M (1973) The strength of weak ties. Am J Sociol 78(6):1360–1380 Rodan S, Galunic C (2004) More than network structure: how knowledge heterogeneity influences managerial performance and innovativeness. Strateg Manag J 25(6):541–562 Eagle N, Macy M, Claxton R (2010) Network diversity and economic development. Science 328(5981):1029–1031 Burt R (2004) Structural holes and good ideas. Am J Sociol 110(2):349–399 Hargadon A, Sutton R (1997) Technology brokering and innovation in a product development firm. Adm Sci Q 42(4):716–749 Ugander J, Backstrom L, Marlow C, Kleinberg J (2012) Structural diversity in social contagion. Proc Natl Acad Sci USA 109(16):5962–5966 Arora A, Gambardella A (1990) Complementarity and external linkages: the strategies of the large firms in biotechnology. J Ind Econ 38(4):361–379 Berg S, Duncan J, Friedman P (1982) Joint venture strategies and corporate innovation. Oelgeschlager, Gunn & Hain. xvi, 192 pages: illustrations Richardson G (1972) The organisation of industry. Econ J 82(327):883–896 Ahuja G (2000) Collaboration networks, structural holes, and innovation: a longitudinal study. Adm Sci Q 45(3):425–455 Jaffe A, Trajtenberg M, Henderson R (1993) Geographic localization of knowledge spillovers as evidenced by patent citations. Q J Econ 108(3):577–598 Bikard M, Murray F, Gans J (2015) Exploring trade-offs in the organization of scientific work: collaboration and scientific reward. Manag Sci 61(7):1473–1495 Leahey E (2016) From sole investigator to team scientist: trends in the practice and study of research collaboration. Annu Rev Sociol 42(1):81–100 Petersen AM (2015) Quantifying the impact of weak, strong, and super ties in scientific careers. Proc Natl Acad Sci 112(34):E4671–E4680 Lorenz J, Rauhut H, Schweitzer F, Helbing D (2011) How social influence can undermine the wisdom of crowd effect. Proc Natl Acad Sci USA 108(22):9020–9025 Amason A, Sapienza H (1997) The effects of top management team size and interaction norms on cognitive and affective conflict. J Manag 23(4):495–516 Iacopini I, Petri G, Barrat A, Latora V (2019) Simplicial models of social contagion. Nat Commun 10:2485 Jabbehdari S, Walsh J (2017) Authorship norms and project structures in science. Sci Technol Human Values 42(5):872–900 Shapin S (1989) The invisible technician. Am Sci 77(6):554–563 Conroy G (2023) Surge in number of ‘extremely productive’ authors concerns scientists. Nature. https://doi.org/10.1038/d41586-023-03865-y Aksnes DW, Langfeldt L, Wouters P (2019) Citations, citation indicators, and research quality: an overview of basic concepts and theories. SAGE Open 9(1):1–17 Wang J, Veugelers R, Stephan P (2017) Bias against novelty in science: a cautionary tale for users of bibliometric indicators. Res Policy 46(8):1416–1436 Uzzi B, Mukherjee S, Stringer M, Jones B (2013) Atypical combinations and scientific impact. Science 342(6157):468–472 Schumpeter J (1934) The theory of economic development. Harvard University Press, Cambridge Funk R, Owen-Smith J (2017) A dynamic network measure of echnological change. Manag Sci 63(3):791–817 Stirling A (2007) A general framework for analysing diversity in science, technology and society. J R Soc Interface 4(15):707–719