#### Filter Results:

#### Publication Year

1999

2001

#### Key Phrase

#### Publication Venue

Learn More

The asymptotic properties of temporal-difference learning algorithms with linear function approximation are analyzed in this paper. The analysis is carried out in the context of the approximation of a discounted cost-to-go function associated with an uncontrolled Markov chain with an uncountable finite-dimensional state-space. Under mild conditions, the… (More)

The asymptotic properties of temporal-difference learning algorithms with linear function approximation are analyzed in this paper. The analysis is carried out in the context of the approximation of a discounted cost-to-go function associated to an uncontrolled Markov chain with an uncountable finite-dimensional state-space. Under very mild conditions, the… (More)

- ‹
- 1
- ›