Learn More
We extend the potential-based shaping method from Markov decision processes to multi-player general-sum stochastic games. We prove that the Nash equilibria in a stochastic game remains unchanged after potential-based shaping is applied to the environment. The property of policy invariance provides a possible way of speeding convergence when learning to play(More)
In this paper we develop a reinforcement fuzzy learning scheme for robots playing a differential game. Differential games are games played in continuous time, with continuous states and actions. Fuzzy controllers are used to approximate the calculation of future reinforcements of the game due to actions taken at a specific time. If an immediate(More)
—Two teams of Unmanned Aerial Vehicles (UAVs) are used in the encirclement of two targets at the same time. Encirclement is defined as the situation in which a target is isolated and surrounded by a group of UAVs. It is a tactic that can be employed by a team of UAVs to neutralize a target by restricting its movement due to a containment motion near the(More)
—An Unmanned Aerial Vehicle (UAV) team formed from two or more UAVs is used in the autonomous encirclement of a stationary target in simulation. The encirclement tactic is defined as the situation in which a target is surrounded by a UAV team in formation. This tactic can be employed by a team of UAVs to neutralize a target by restricting its movement. A(More)