Ilham El Bouloumi

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
The purpose of this paper is to develop a selfoptimized association algorithm based on Policy Gradient Reinforcement Learning (PGRL), which is both scalable, stable and robust. The term robust means that performance degradation in the learning phase should be forbidden or limited to predefined thresholds. The algorithm is model-free (as opposed to Value(More)
  • 1