Reward prediction error

@article{Schultz2017RewardPE,
  title={Reward prediction error},
  author={Wolfram Schultz},
  journal={Current Biology},
  year={2017},
  volume={27},
  pages={R369-R371}
}
  • W. Schultz
  • Published 22 May 2017
  • Psychology, Biology
  • Current Biology
Dopamine role in learning and action inference
TLDR
Dopaminergic neurons projecting to different parts of the striatum encode errors in predictions made by the corresponding systems within the basal ganglia, which enable learning about rewards resulting from actions and habit formation.
Beyond Simple Tests of Value: A neuroeconomic, translational, disease-relevant, and circuit-based approach to resolve the computational complexity of decision making
TLDR
This dissertation presents a meta-anatomy of the central nervous system and its role in reward and punishment and describes its role as a “spatially aggregating force” in the response to disease.
The biological and behavioral computations that influence dopamine responses
  • WR Stauffer
  • Psychology, Biology
    Current Opinion in Neurobiology
  • 2018
Reward Prediction Errors Reflect an Underlying Learning Process That Parallels Behavioural Adaptations: A Trial-to-Trial Analysis
TLDR
It is found that neural measures did diminish with trial-to-trial changes in performance and that they were predictive of behavioural adaptations in both simulated and empirical data.
Is the encoding of Reward Prediction Error reliable during development?
An economic decision-making model of anticipated surprise with dynamic expectation
When making decisions under risk, people often exhibit behaviors that classical economic theories cannot explain. Newer models that attempt to account for these ‘irrational’ behaviors often lack
Feature Specific Prediction Errors and Surprise across Macaque Fronto-Striatal Circuits during Attention and Learning
TLDR
It is shown that neurons in all areas of the medial and lateral fronto-striatal networks encode prediction errors that are specific to separate features of attended multidimensional stimuli, with the most ubiquitous prediction error occurring for the reward relevant features.
VTA dopamine neuron activity encodes social interaction and promotes reinforcement learning through social prediction error
TLDR
This study shows that dopamine neurons of the ventral tegmental area of the VTA increase their activity during interactions with an unfamiliar conspecific and display heterogeneous responses, and suggests that VTA DA neurons are a neural substrate for a social learning signal that drives motivated behavior.
Behavioural variability and cortical electrophysiological signals depend on recent outcomes during human reinforcement motor learning
TLDR
The results suggest that S+ experiences ‘overwrite’ previous motor states to a greater extent than S- experiences and that modulations in neural oscillations in the prefrontal cortex play a potential role in encoding the (changes in) movement variability state during reinforcement motor learning.
...
...

References

SHOWING 1-10 OF 36 REFERENCES
Dopamine reward prediction-error signalling: a two-component response
  • W. Schultz
  • Psychology, Biology
    Nature Reviews Neuroscience
  • 2016
TLDR
It is suggested that an initial brief, unselective and highly sensitive increase in activity unspecifically detects a wide range of environmental stimuli, then quickly evolves into the main response component, which reflects subjective reward value and utility.
Introduction to Reinforcement Learning
TLDR
In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.
Computational roles for dopamine in behavioural control
TLDR
The application of such quantitative models has opened up new fields, ripe for attack by young synthesizers and theoreticians, to describe the biological algorithms at play in their brains when the authors form value judgements and make choices.
Getting Formal with Dopamine and Reward
A Neural Correlate of Reward-Based Behavioral Learning in Caudate Nucleus: A Functional Magnetic Resonance Imaging Study of a Stochastic Decision Task
TLDR
Functional magnetic resonance imaging of a stochastic decision task involving monetary rewards, in which subjects had to learn behaviors involving different task difficulties that were controlled by probability, found that activity in the caudate nucleus was correlated with short-term reward and paralleled the magnitude of a subject's behavioral change during learning.
Importance of unpredictability for reward responses in primate dopamine neurons.
TLDR
D dopamine neurons responded to liquid reward during learning but not when task performance was established, at which time the neuronal response occurred to the conditioned, reward-predicting, movement-triggering stimulus.
...
...