Benoit Guilhabert

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
This article considers the problem of learning the correct temporal sequence of discrete behaviors from a finite behavior set that will lead to completion of a complex task, using only stochastic reinforcement from the environment. A trial-and-error learning algorithm is proposed that is inspired by backward chaining from the animal training discipline. The(More)
  • 1