Temporal Difference Learning for the Game Tic-Tac-Toe 3D: Applying Structure to Neural Networks

When reinforcement learning is applied to large state spaces, such as those occurring in playing board games, the use of a good function approximator to learn to approximate the value function is very important. In previous research, multi-layer perceptrons have often been quite successfully used as function approximator for learning to play particular…