A Neural Network PID-Like Controller Using a Hybrid of Online Actor-Critic Reinforcement Algorithm with the Square Root Cubature Kalman Filter

@article{Sento2018ANN,
  title={A Neural Network PID-Like Controller Using a Hybrid of Online Actor-Critic Reinforcement Algorithm with the Square Root Cubature Kalman Filter},
  author={Adna Sento and Yuttana Kitjaidure},
  journal={International Journal of Intelligent Engineering and Systems},
  year={2018}
}
  • A. Sento, Y. Kitjaidure
  • Published 31 December 2018
  • Computer Science, Engineering
  • International Journal of Intelligent Engineering and Systems
This paper presents a new model of the Neural Network PID-Like controller using an Actor-Critic reinforcement algorithm, called the Neural Network PID-Like controller using an Actor-Critic reinforcement algorithm (NNPID-AC). The proposed NNPID-AC controller is designed to develop the performances and the speed of calculation under the iterative learning algorithm. In the learning algorithm, the critic algorithm receives the reward value and control input to criticize the current state using the… 
1 Citations

Figures and Tables from this paper

Airborne Computer System Path-Tracking Based Multi-PID-PSO Controller Design
TLDR
The main goal of this research is to design six PID controllers to control the high nonlinear UAV quadcopter system by using an on-line Particle Swarm Optimization (PSO) algorithm that uses to learn and tune the eighteen control gain parameters based on multi-objective function.

References

SHOWING 1-10 OF 28 REFERENCES
Neural network controller based on PID using an extended Kalman filter algorithm for multi-variable non-linear control system
  • A. Sento, Y. Kitjaidure
  • Computer Science, Engineering
    2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)
  • 2016
TLDR
An improvement of recurrent neural network controller based on PID, including a controller structure improvement and a modified extended Kalman filter (EKF) learning algorithm for weight update rule, called ENNPID controller is presented.
Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks
TLDR
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal control solution for unknown constrained-input systems where two neural networks are tuned online and simultaneously to generate the optimal bounded control policy.
A hybrid CKF-NNPID controller for MIMO nonlinear control system
  • A. Sento, Y. Kitjaidure
  • Engineering, Computer Science
    2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)
  • 2016
TLDR
This paper presents a detailed study to demonstrate the online tuning dynamic neural network PID controller to improve a joint angle position output performance of 4-joint robotic arm and shows that the proposed controller is a superior control method comparing with the other prior controllers.
PID-Like Neural Network Nonlinear Adaptive Control for Uncertain Multivariable Motion Control Systems
A mix locally recurrent neural network was used to create a proportional-integral-derivative (PID)-like neural network nonlinear adaptive controller for uncertain multivariable
EKF-Based PI-/PD-Like Fuzzy-Neural-Network Controller for Brushless Drives
  • A. Rubaai, P. Young
  • Engineering, Computer Science
    IEEE Transactions on Industry Applications
  • 2011
TLDR
Experimental testing results show that the proposed controller learns and robustly responds to a wide range of operating conditions in real time.
A stable self-learning PID control for multivariable time varying systems
Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems
TLDR
Two neural networks, namely, actor NN and critic NN, are tuned online and simultaneously to generate the optimal bounded control policy to encode the input constraints into the optimization problem using a nonquadratic performance function.
Actor-Critic Off-Policy Learning for Optimal Control of Multiple-Model Discrete-Time Systems
TLDR
A model-free off-policy reinforcement learning algorithm is developed to solve the optimal tracking control of multiple-model linear discrete-time systems and helps to increase data efficiency and speed of tuning.
Fault-Tolerant Lyapunov-Gain-Scheduled PID Control of a Quadrotor UAV
TLDR
New designs of the Lyapunov gain Scheduled PID controller (LGSPID) were presented to improve the PID control effect, and the obtained results confirm the effectiveness of the proposed method.
Speed Control of DC Motor Using PID and FOPID Controllers Based on Differential Evolution and PSO
TLDR
Simulation results show that the DE-based FOPID controller tuning approach provides improved performance for the setpoint tracking, error minimization, and measurement noise attenuation.
...
...