Social reward shaping in the prisoner's dilemma

  title={Social reward shaping in the prisoner's dilemma},
  author={Monica Babes-Vroman and Enrique Munoz de Cote and Michael L. Littman},
Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward shaping, which is reward shaping applied in the multiagentlearning framework. We present preliminary experiments in the iterated Prisoner’s dilemma setting that show that agents using social reward shaping appropriately can behave more effectively than other classical learning and nonlearning strategies. In particular, we… CONTINUE READING
Highly Cited
This paper has 50 citations. REVIEW CITATIONS
33 Citations
7 References
Similar Papers


Publications citing this paper.

fewer than 50 Citations

Citations per Year
Semantic Scholar estimates that this publication has 50 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…