Vien Anh Ngo

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
Modeling policies in reproducing kernel Hilbert space (RKHS) offers a very flexible and powerful new family of policy gradient algorithms called RKHS policy gradient algorithms. They are designed to optimize over a space of very high or infinite dimensional policies. As a matter of fact, they are known to suffer from a large variance problem. This critical(More)
  • 1