Christopher R. Mansley

Learn More
In this paper, we present a new algorithm that integrates recent advances in solving continuous bandit problems with sample-based rollout methods for planning in Markov Decision Processes (MDPs). Our algorithm, Hierarchical Optimistic Optimization applied to Trees (HOOT) addresses planning in continuous action MDPs, directing the exploration of the search(More)
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (MDPs), where compact function approximation has to be used. In this paper, we provide a practical solution to exploring large MDPs by integrating a powerful exploration technique,(More)
We describe an inexpensive robot that serves as a physical autonomic element, capable of navigating, mapping and monitoring data centers with little or no human involvement, even ones that it has never seen before. Through a series of real experiments and simulations, we establish that the robot is sufficiently accurate, efficient and robust to be of(More)
We describe an inexpensive autonomous robot capable of navigating previously unseen data centers and monitoring key metrics such as air temperature<sup>1</sup>. The robot provides real-time navigation and sensor data to commercial IBM software, thereby enabling real-time generation of the data center layout, a thermal map and other visualizations of energy(More)
Based on the same principles as a single-rotor helicopter, a quadrotor is a flying vehicle that is propelled by four horizontal blades surrounding a central chassis. Because of this vehicle’s symmetry and propulsion mechanism, a quadrotor is capable of simultaneously moving and steering by simple modulation of motor speeds [1]. This stability and relative(More)
Terrain classification in robotics has heavily focused on determining a region for traversal, while also labeling obstacles. Our work attempts to expand this essentially binary viewpoint and to use terrain classifiers as an indicator for different system dynamics. By learning multiple models of the system dynamics, the robot is able to assess alternative(More)
Abstract: In this paper, we develop a suite of motion planning strategies suitable for largescale sensor networks. These solve the problem of reconfiguring the network to a new shape while minimizing either the total distance traveled by the nodes or the maximum distance traveled by any node. Three network paradigms are investigated: centralized,(More)
In this poster/software demonstration we illustrate the integration of an autonomous mobile robot into a slightly customized version of a commercially available asset and data center energy management application known as Maximo Asset Management for Energy Optimization (MEO), version 7.1.1, through a number of practical scenarios. The scenarios showcase(More)
  • 1