Matt Oberdorfer

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
We present the first reinforcement-learning model to self-improve its reward-modulated training implemented through a continuously improving " intuition " neural network. An agent was trained how to play the arcade video game Pong with two reward-based alternatives, one where the paddle was placed randomly during training, and a second where the paddle was(More)
  • 1