Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming

Abstract

Goal representation globalized dual heuristic dynamic programming (Gr-GDHP) method is proposed in this paper. A goal neural network is integrated into the traditional GDHP method providing an internal reinforcement signal and its derivatives to help the control and learning process. From the proposed architecture, it is shown that the obtained internal reinforcement signal and its derivatives can be able to adjust themselves online over time rather than a fixed or predefined function in literature. Furthermore, the obtained derivatives can directly contribute to the objective function of the critic network, whose learning process is thus simplified. Numerical simulation studies are applied to show the performance of the proposed Gr-GDHP method and compare the results with other existing adaptive dynamic programming designs. We also investigate this method on a ball-and-beam balancing system. The statistical simulation results are presented for both the Gr-GDHP and the GDHP methods to demonstrate the improved learning and controlling performance.

DOI: 10.1109/TCYB.2016.2598282

Cite this paper

@article{Zhong2017GrGDHPAN, title={Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming}, author={Xiangnan Zhong and Zhen Ni and Haibo He}, journal={IEEE transactions on cybernetics}, year={2017}, volume={47 10}, pages={3318-3330} }