Optimal Cost Design for Model Predictive Control
@inproceedings{Jain2021OptimalCD, title={Optimal Cost Design for Model Predictive Control}, author={Avik Jain and Lawrence Chan and Daniel S. Brown and Anca D. Dragan}, booktitle={L4DC}, year={2021} }
Many robotics domains use some form of nonconvex model predictive control (MPC) for planning, which sets a reduced time horizon, performs trajectory optimization, and replans at every step. The actual task typically requires a much longer horizon than is computationally tractable, and is specified via a cost function that cumulates over that full horizon. For instance, an autonomous car may have a cost function that makes a desired trade-off between efficiency, safety, and obeying traffic laws…
One Citation
Leveraging Human Input to Enable Robust AI Systems
- Computer Science
- 2021
This work applies rigorous theory and state-of-the-art machine learning techniques to enable AI systems to maintain, be robust to, and actively reduce uncertainty over both the human’s intent and the corresponding optimal policy.
References
SHOWING 1-10 OF 31 REFERENCES
Practical Reinforcement Learning For MPC: Learning from sparse objectives in under an hour on a real robot
- Computer ScienceL4DC
- 2020
The experiments show that the Reinforcement Learning method can learn the cost function from scratch and without human intervention, while reaching a performance level similar to that of an expert-tuned MPC.
Value function approximation and model predictive control
- Mathematics2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)
- 2013
Two methods of deriving a descriptive final cost function to assist model predictive control (MPC) in selecting a good policy without having to plan as far into the future or having to fine-tune delicate cost functions are explored.
Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search
- Computer Science2016 IEEE International Conference on Robotics and Automation (ICRA)
- 2016
This work proposes to combine MPC with reinforcement learning in the framework of guided policy search, where MPC is used to generate data at training time, under full state observations provided by an instrumented training environment, and a deep neural network policy is trained, which can successfully control the robot without knowledge of the full state.
Deep Value Model Predictive Control
- Computer ScienceCoRL
- 2019
This paper introduces an actor-critic algorithm called Deep Value Model Predictive Control (DMPC), which combines model-based trajectory optimization with value function estimation and shows that including the value function in the running cost of the trajectory optimizer speeds up the convergence.
Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control
- Computer ScienceICLR
- 2019
A plan online and learn offline (POLO) framework for the setting where an agent, with an internal model, needs to continually act and learn in the world and how trajectory optimization can be used to perform temporally coordinated exploration in conjunction with estimating uncertainty in value function approximation.
MOBILE ROBOT TRAJECTORY TRACKING USING MODEL PREDICTIVE CONTROL
- Engineering, Mathematics
- 2005
This work focus on the application of model-based predictive control (MPC) to the trajectory tracking problem of nonholonomic wheeled mobile robots (WMR). The main motivation of the use of MPC in…
The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
- MathematicsOper. Res.
- 1973
If there are only a finite number of control intervals remaining, then the optimal payoff function is a piecewise-linear, convex function of the current state probabilities of the internal Markov process, and an algorithm for utilizing this property to calculate the optimal control policy and payoff function for any finite horizon is outlined.
Learning-B ased Nonlinear Model Predictive Control to Improve Vision-Based Mobile Robot Path Tracking
- Engineering
- 2017
This paper presents a Learning-based Nonlinear Model Predictive Control (LB-NMPC) algorithm to achieve high-performance path tracking in challenging o↵-road terrain through learning. The LB-NMPC…
ABC-LMPC: Safe Sample-Based Learning MPC for Stochastic Nonlinear Dynamical Systems with Adjustable Boundary Conditions
- Computer ScienceWAFR
- 2021
A novel LMPC algorithm, Adjustable Boundary Condition LMPC (ABC-LMPC), is presented, which enables rapid adaptation to novel start and goal configurations and theoretically shows that the resulting controller guarantees iterative improvement in expectation for stochastic nonlinear systems.
Model predictive control: Theory and practice - A survey
- EngineeringAutom.
- 1989