Skip to main content

Showing 1–2 of 2 results for author: Wheelwright, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:1903.08082  [pdf, other

    cs.MA cs.LG

    Learning Reciprocity in Complex Sequential Social Dilemmas

    Authors: Tom Eccles, Edward Hughes, János Kramár, Steven Wheelwright, Joel Z. Leibo

    Abstract: Reciprocity is an important feature of human social interaction and underpins our cooperative nature. What is more, simple forms of reciprocity have proved remarkably resilient in matrix game social dilemmas. Most famously, the tit-for-tat strategy performs very well in tournaments of Prisoner's Dilemma. Unfortunately this strategy is not readily applicable to the real world, in which options to c… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

  2. arXiv:1812.07019  [pdf, other

    cs.NE cs.MA q-bio.PE

    Malthusian Reinforcement Learning

    Authors: Joel Z. Leibo, Julien Perolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel

    Abstract: Here we explore a new algorithmic framework for multi-agent reinforcement learning, called Malthusian reinforcement learning, which extends self-play to include fitness-linked population size dynamics that drive ongoing innovation. In Malthusian RL, increases in a subpopulation's average return drive subsequent increases in its size, just as Thomas Malthus argued in 1798 was the relationship betwe… ▽ More

    Submitted 3 March, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

    Comments: 9 pages, 2 tables, 4 figures