#### Filter Results:

- Full text PDF available (153)

#### Publication Year

1985

2017

- This year (3)
- Last 5 years (33)
- Last 10 years (77)

#### Publication Type

#### Co-author

#### Journals and Conferences

#### Data Set Used

#### Key Phrases

Learn More

- Andrew G. Barto, Sridhar Mahadevan
- Discrete Event Dynamic Systems
- 2003

Reinforcement learning is bedeviled by the curse of dimensionality: the number of parameters to be learned grows exponentially with the size of any compact encoding of a state. Recent attempts to combat the curse of dimensionality have turned to principled ways of exploiting temporal abstraction, where decisions are not required at each step, but rather… (More)

- Sridhar Mahadevan, Jonathan H. Connell
- AAAI
- 1991

This paper describes a general approach for automatically programming a behavior-based robot. New behaviors are learned by trial and error using a performance feedback function as reinforcement. Two algorithms for behavior learning are described that combine Q learning, a well known scheme for propagating reinforcement values temporally across actions, with… (More)

- Sridhar Mahadevan
- Machine Learning
- 1996

This paper presents a detailed study of average reward reinforcement learning, an undiscounted optimality framework that is more appropriate for cyclical tasks than the much better studied discounted framework. A wide spectrum of average reward algorithms are described, ranging from synchronous dynamic programming methods to several (provably convergent)… (More)

- Chang Wang, Sridhar Mahadevan
- IJCAI
- 2011

We propose a manifold alignment based approach for heterogeneous domain adaptation. A key aspect of this approach is to construct mappings to link different feature spaces in order to transfer knowledge across domains. The new approach can reuse labeled data from multiple source domains in a target domain even in the case when the input domains do not share… (More)

- Sridhar Mahadevan, Mauro Maggioni
- Journal of Machine Learning Research
- 2007

This paper introduces a novel spectral framework for solving Markov decision processes (MDPs) by jointly learning representations and optimal policies. The major components of the framework described in this paper include: (i) A general scheme for constructing representations or basis functions by diagonalizing symmetric diffusion operators (ii) A specific… (More)

A large class of problems of sequential decision making under uncertainty, of which the underlying probability structure is a Markov process, can be modeled as stochastic dynamic programs (referred to, in general, as Markov decision problems or MDPs). However, the computational complexity of the classical MDP algorithms, such as value iteration and policy… (More)

- Chang Wang, Sridhar Mahadevan
- IJCAI
- 2009

Manifold alignment has been found to be useful in many areas of machine learning and data mining. In this paper we introduce a novel manifold alignment approach, which differs from “semisupervised alignment” and “Procrustes alignment” in that it does not require predetermining correspondences. Our approach learns a projection that maps data instances (from… (More)

- Chang Wang, Sridhar Mahadevan
- ICML
- 2008

In this paper we introduce a novel approach to manifold alignment, based on Procrustes analysis. Our approach differs from "semi-supervised alignment" in that it results in a mapping that is defined everywhere - when used with a suitable dimensionality reduction method - rather than just on the training data points. We describe and evaluate our approach… (More)

- Ivon Arroyo, Kimberly Ferguson, +6 authors Beverly Park Woolf
- AIED
- 2007

We evaluated the impact of a set of interventions to repair students’ disengagement while solving geometry problems in a tutoring system. We present a deep analysis of how a tutor can remediate a student’s disengagement and motivation with self-monitoring feedback. The analysis consists of a between-subjects analyses on students learning and on students’… (More)

- Tom M. Mitchell, Sridhar Mahadevan, Louis I. Steinberg
- IJCAI
- 1985

It is by now well-recognised that a major impediment to developing know ledge-based systems is the knowledge acquisit ion bott leneck the task of bui lding up a complete enough and correct enough knowledge base to provide high-level performance. Th is paper proposes a new class of knowledge-based systems designed to address this knowledge-acquisition… (More)