Learn More
We describe our technical approach in competing at the RoboCup 2000 Sony legged robot league. The UNSW team won both the challenge competition and all their soccer matches, emerging the outright winners for this league against eleven other international teams. The main advantage that the UNSW team had was speed. The robots not only moved quickly, due to a(More)
Competing at the RoboCup 2000 Sony legged robot league, the UNSW team won both the challenge competition and all their soccer matches, emerging the outright winners for this league against eleven other international teams. The main advantage that the UNSW team had was speed. A major contributor to the speed was a novel omnidirectional locomotion method(More)
This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The CQ algorithm uses a heuristic which is applicable for problems that can be modelled by a set of state variables that conform to a special ordering, defined in this paper as a "(More)
A challenge in applying reinforcement learning to large problems is how to manage the explosive increase in storage and time complexity. This is especially problematic in multi-agent systems, where the state space grows exponentially in the number of agents. Function approximation based on simple supervised learning is unlikely to scale to complex domains(More)
In this report we present a comprehensive description of the software system developed to compete in the Sony legged robot league competition at RoboCup 2000. The UNSW team won both the challenge competition and all their soccer matches, to take the championship in a field of 12 teams. At RoboCup 2000, the UNSW robots had distinct advantages in locomotion,(More)
Multi-agent robotic competitions such as RoboCup provide the motivation for a developmental research agenda – one that focuses on the evolution of complete working systems and their cognitive architectures. In this paper, we describe the components and integration of one such system – the 2010 RoboCup Standard Platform League entry rUNSWift. The real-time(More)
—We learn a controller for a flat-footed bipedal robot to optimally respond to both (1) external disturbances caused by, for example, stepping on objects or being pushed, and (2) rapid acceleration, such as reversal of demanded walk direction. The reinforcement learning method employed learns an optimal policy by actuating the ankle joints to assert(More)
One of the challenges of rescue robotics is to create robots that can autonomously traverse rough, unstructured terrain. Although mechanical engineering can produce very capable robots, mechanical engineering alone will not drive them. In this paper, we present a terrain feature extractor that can be taught to find significant features in range images of(More)