Learn More
This article is part of the field of artificial intelligence (AI), specifically it deals with reinforcement learning techniques to solve a problem modeled by Markov decision processes partially observed (POMDP). The objective in this article is to design an algorithm (based on the algorithm Q-learning) to be implemented on agents, immersed in an environment(More)
  • 1