The economics analysis of a Q-learning model of cooperation with punishment and risk taking preferences

Nazaria Solferino1, Viviana Solferino2, Serena Fiona Taurino1
1Economics Department, University of Rome “Tor Vergata”, Rome, Italy
2Mathematics and Computer Science Department, University of Calabria, Rende, Italy

Tóm tắt

Từ khóa


Tài liệu tham khảo

Andreoni J (1989) Giving with impure altruism: applications to charities and Ricardian equivalence. J Polit Econ 97(6):1447–1458

Andreoni J (1990) Impure altruism and donations to public goods: a theory of warm glow giving. Econ J 100(401):464–477

Becchetti L, Pelligra V, Taurino SF (2015) Other regarding preferences and reciprocity: insights from experimental findings and satisfaction. Data CEIS working paper no. 363

Beggs AW (2005) On the convergence of reinforcement learning. J Econ Theory 122:1–36

Boyd R, Richerson PJ (1988) The evolution of reciprocity in sizeable groups. J Theor Biol 131:337–356

Boyd R, Richerson PJ (1992) Punishment allows the evolution of cooperation (or anything else) in sizable groups. Ethol Sociobiol 13(5):171–195

Brandt H, Hauert C, Sigmund K (2006) Punishing and abstaining for public goods. Proc Natl Acad Sci 103:495–497

Bruni L (2006) Reciprocità. Dinamiche di cooperazione, economia e società civile. Mondadori Eds, Milan

Burton-Chellew MN, Nax HH, West SA (2015) Payoff-based learning explains the decline in cooperation in public goods games. Proc R Soc Lond Ser B 282:20142678

Dawes RM (1980) Social dilemmas. Annu Rev Psychol 31:169–193

Dercole F, DeCarli M, Della Rossa F, Papadopoulos AV (2013) Overpunishing is not necessary to fix cooperation in voluntary public goods games. J Theor Biol 326:70–81

Efferson C, Roca CP, Vogt S, Helbing D (2016) Sustained cooperation by running away from bad behavior. Evol Hum Behav 37(1):1–9

Egas M, Riedl A (2008) The economics of altruistic punishment and the maintenance of cooperation. Proc Biol Sci 275:871–878

Fehr E, Gachter S (2002) Altruistic punishment in humans. Nature 415:137–140

Fowler JH (2005) Altruistic punishment and the origin of cooperation. Proc Natl Acad Sci 102:7047–7049

Gómez-Gardeñes J, Poncela J, Floria LM, Moreno YJ (2008) Natural selection of cooperation and degree hierarchy in heterogeneous populations. J Theor Biol 253:296–301

Hardin G (1968) The tragedy of the commons. Science 162:1243–1248

Hauert C, Schuster P (1998) Extending the iterated prisoner’s dilemma without synchrony. J Theor Biol 192(2):155–166

Hauert C, Traulsen A, Brandt H, Nowak MA, Sigmund K (2007) Via freedom to coercion: the emergence of costly punishment. Science 316:1905–1907

Hauert C, Traulsen A, Brandt H, Nowak MA, Sigmund K (2008) Public goods with punishment and abstaining in finite and infinite populations. Biol Theory 3(2):114–122

Kagel J, Roth A (1997) The handbook of experimental economics. Princeton University Press, Princeton

Kianercy A, Galstyan A (2012) Dynamics of Boltzmann Q learning in two-player two-action games. Phys Rev E 85:041145

Lewin AY, Long CP, Carroll TN (1999) The coevolution of new organizational forms. Organ Sci 10(5):535–550

Nakamaru M, Dieckmann U (2009) Runaway selection for cooperation and strict-and-severe punishment. J Theor Biol 257(1):1–8

Nax HH, Perc M (2015) Directional learning and the provisioning of public goods. Sci Rep 5:8010

Nax HH, Perc M, Szolnoki A, Helbing D (2015) Stability of cooperation under image scoring in group interactions. Sci Rep 5:12145

Nax HH, Burton-Chellew MN, West SA, Young PH (2016) Learning in a black box. J Econ Behav Organ 127:1–15

Perc M, Gómez-Gardeñes J, Szolnoki A, Floría LM, Moreno Y (2013) Evolutionary dynamics of group interactions on structured populations: a review. J R Soc Interface 10:20120997

Raub W, Snijders C (1997) Gains, losses and cooperation in social dilemmas and collective action: the effects of risk preferences. J Math Sociol 22:263–291

Santos FC, Pacheco JM (2005) Scale-free networks provide a unifying framework for the emergence of cooperation. Phys Rev Lett 95:098104

Santos FC, Santos MD, Pacheco JM (2008) Social diversity promotes the emergence of cooperation in public goods games. Nature 454:213–216

Sasaki T, Brannstrom A, Dieckmanna U, Sigmund K (2012) The take-it-or-leave-it option allows small penalties to overcome social dilemmas. Proc Natl Sci USA 109:1165–1169

Schuster P, Sigmund K (1983) Replicator dynamics. J Theor Biol 100:533–538

Sigmund K, DeSilva H, Traulsen A, Hauert C (2010) Social learning promotes institutions for governing the commons. Nature 466:861–863

Solferino N, Taurino SF (2016) The evolution of cooperation with punishment and second chance. Homo Oecon 32(3/4):387–400

Strogatz SH (2001) Nonlinear dynamics and chaos: with applications to physics, biology, chemistry, and engineering. Westview Press, Boulder

Van Assen MALM, Snijders C (2004) Effects of risk preferences in social dilemmas: a game-theoretical analysis and evidence from two experiments. Cambridge University Press, Cambridge

Waltman L, Kaymak U (2008) $$Q$$ Q -learning agents in a Cournot oligopoly model. J Econ Dyn Control 32(10):3275–3293

West SA, Griffin AS, Gardner A (2007) Evolutionary explanations for cooperation. Curr Biol 17:R661–R672

Xie MC, Tachibana A (2007) Cooperative behavior acquisition for multi-agent systems by Q-learning. In: Proceeding of the 2007 IEEE symposium on foundations of computational intelligence, FOCI 2007, 1–5 April 2007. IEEE, pp 424–428

Young HP (2009) Learning by trial and error. Games Econ Behav 65:626–643