• Corpus ID: 197679667

Score ! ! Time ? ? ? ? ? ? ? ? ?

  title={Score ! ! Time ? ? ? ? ? ? ? ? ?},
  • Published 2013
Take a brief pause from reality and imagine that you are the first-year coach of a professional basketball team. It is the first game of the season, and the score is tied with only seconds remaining. You call a timeout, consider your multitude of strategic options, and then decide to set up a play for the guy that everyone just calls C. The play begins, C gets the ball, and though double-covered by two hard-nosed defenders, gets off a long shot. Swish! The ball goes through the net, you win the… 


Temporal difference learning and TD-Gammon
The domain of complex board games such as Go, chess, checkers, Othello, and backgammon has been widely regarded as an ideal testing ground for exploring a variety of concepts and approaches in artificial intelligence and machine learning.
Solving the Distal Reward Problem with Rare Correlations
The hypothesis that short-term synaptic plasticity may implement eligibility traces and thereby serve as a selection mechanism in promoting candidate synapses for long-term storage is suggested.
This paper presents and proves in detail a convergence theorem forQ-learning based on that outlined in Watkins (1989), showing that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action- values are represented discretely.
Reinforcement Learning: An Introduction
This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Solving the distal reward problem through linkage of STDP and dopamine signaling
This study emphasizes the importance of precise firing patterns in brain dynamics and suggests how a global diffusive reinforcement signal in the form of extracellular DA can selectively influence the right synapses at the right time.
What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
  • K. Doya
  • Computer Science, Medicine
    Neural Networks
  • 1999
This paper investigates how the learning modules specialized for these three kinds of learning can be assembled into goal-oriented behaving systems and presents a novel view that their computational roles can be characterized by asking what are the "goals" of their computation.
Interactions between learning and evolution
Artificial Life (AL) test bed Results Decision Making Project Applicability Motivation This combines two completely different time scales Learning is a individual behaviour Evolution operates at the
Evolution of Reinforcement Learning in Uncertain Environments: A Simple Explanation for Complex Foraging Behaviors
Reinforcement learning is a fundamental process by which organisms learn to achieve goals from their interactions with the environment. Using evolutionary computation techniques we evolve
A New Factor in Evolution
  • J. Baldwin
  • Biology, Medicine
    The American Naturalist
  • 1896
The object of the present paper is to gather into one sketch an outline of the view of the process of development which these different publications have hinged upon.
Adaptive Critics and the Basal Ganglia
One of the most active areas of research in artificial intelligence is the study of learning methods by which “embedded agents” can improve performance while acting in complex dynamic environments.