Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties

Information Sciences - Tập 463 - Trang 307-322 - 2018
Xiong Yang1,2, Haibo He2, Qinglai Wei3, Biao Luo3
1School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
2Department of Electrical, Computer, and Biomedical Engineering, University of Rhode Island, Kingston, RI 02881, USA
3The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

Tài liệu tham khảo

Abu-Khalaf, 2005, Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach, Automatica, 41, 779, 10.1016/j.automatica.2004.11.034 Basar, 1995 Bu, 2016, Neural-approximation-based robust adaptive control of flexible air-breathing hypersonic vehicles with parametric uncertainties and control input constraints, Inf. Sci., 346, 29, 10.1016/j.ins.2016.01.093 Chowdhary, 2011, A singular value maximizing data recording algorithm for concurrent learning, 3547 Fu, 2016, Online solution of two-player zero-sum games for continuous-time nonlinear systems with completely unknown dynamics, IEEE Trans. Neural Netw. Learn. Syst., 27, 2577, 10.1109/TNNLS.2015.2496299 Gao, 2018, Stabilization of nonlinear systems using event-triggered controllers with dwell times, Inf. Sci., 457–458, 156, 10.1016/j.ins.2018.04.002 Hornik, 1990, Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks, Neural Netw., 3, 551, 10.1016/0893-6080(90)90005-6 Ioannou, 2012 Jiang, 2014, Robust adaptive dynamic programming and feedback stabilization of nonlinear systems, IEEE Trans. Neural Netw. Learn. Syst., 25, 882, 10.1109/TNNLS.2013.2294968 Kamalapurkar, 2017, Model-based reinforcement learning for infinite-horizon approximate optimal tracking, IEEE Trans. Neural Netw. Learn. Syst., 28, 753, 10.1109/TNNLS.2015.2511658 Khalil, 2002 Lewis, 1999 Li, 2018, Manifold regularized reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst., 29, 932, 10.1109/TNNLS.2017.2650943 Lin, 2007 Littman, 2015, Reinforcement learning improves behaviour from evaluative feedback, Nature, 521, 445, 10.1038/nature14540 Liu, 2014, Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems, IEEE Trans. Cybern., 44, 2834, 10.1109/TCYB.2014.2357896 Liu, 2017 Liu, 2018, Robust event-triggered control for networked control systems, Inf. Sci., 459, 186, 10.1016/j.ins.2018.02.057 Liu, 2015, Reinforecement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints, IEEE Trans. Cybern., 45, 1372, 10.1109/TCYB.2015.2417170 Luo, 2017, Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems, Inf. Sci., 411, 66, 10.1016/j.ins.2017.05.005 Mahadevan, 2007, Proto-value functions: a Laplacian framework for learning representation and control in Markov decision processes, J. Mach. Learn. Res., 8, 2169 Modares, 2015, h∞ Tracking control of completely unknown continuous-time systems via off-policy reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst., 26, 2550, 10.1109/TNNLS.2015.2441749 Modares, 2014, Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems, Automatica, 50, 193, 10.1016/j.automatica.2013.09.043 Mu, 2017, Adaptive tracking control for a class of continuous-time uncertain nonlinear systems using the approximate solution of HJB equation, Neurocomputing, 260, 432, 10.1016/j.neucom.2017.04.043 Narayanan, 2017, Event-triggered distributed control of nonlinear interconnected systems using online reinforcement learning with exploration, IEEE Trans. Cybern. Narendra, 1987, A new adaptive law for robust adaptation without persistent excitation, IEEE Trans. Automat. Control, 32, 134, 10.1109/TAC.1987.1104543 Song, 2017, Neural-network-based synchronous iteration learning method for multi-player zero-sum games, Neurocomputing, 242, 73, 10.1016/j.neucom.2017.02.051 Stevens, 2015 Vamvoudakis, 2010, Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem, Automatica, 46, 878, 10.1016/j.automatica.2010.02.018 Vamvoudakis, 2017, Game theory-based control system algorithms with real-time reinforcement learning: how to solve multiplayer games online, IEEE Control Syst., 37, 33, 10.1109/MCS.2016.2621461 Vrabie, 2013 Wang, 2016, Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties, Inf. Sci., 366, 121, 10.1016/j.ins.2016.05.034 Wang, 2014, Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming, Inf. Sci., 282, 167, 10.1016/j.ins.2014.05.050 Wang, 2016, Fault-tolerant controller design for a class of nonlinear MIMO discrete-time systems via online reinforcement learning algorithm, IEEE Trans. Syst. Man Cybern., 46, 611, 10.1109/TSMC.2015.2478885 Wei, 2017, Adaptive dynamic programming-based optima control scheme for energy storage systems with solar renewable energy, IEEE Trans. Ind. Electron., 64, 5468, 10.1109/TIE.2017.2674581 Whiteson, 2010 Xu, 2014, A clustering-based graph Laplacian framework for value function approximation in reinforcement learning, IEEE Trans. Cybern., 44, 2613, 10.1109/TCYB.2014.2311578 Xu, 2017, Manifold-based reinforcement learning via locally linear reconstruction, IEEE Trans. Neural Netw. Learn. Syst., 28, 934, 10.1109/TNNLS.2015.2505084 Yang, 2018, Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances, Neural Netw., 99, 19, 10.1016/j.neunet.2017.11.022 Yang, 2017, Event-triggered optimal neuro-controller design with reinforcement learning for unknown nonlinear systems, IEEE Trans. Syst. Man Cybern. Yang, 2017, Adaptive dynamic programming for robust neural control of unknown continuous-time nonlinear systems, IET Control Theory Appl., 11, 2307, 10.1049/iet-cta.2017.0154 Yang, 2016, Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning, Inf. Sci., 369, 731, 10.1016/j.ins.2016.07.051 Yang, 2015, Direct adaptive control for a class of discrete-time unknown nonaffine nonlinear systems using neural networks, Int. J. Robust Nonlinear Control, 25, 1844, 10.1002/rnc.3181 Zhang, 2017, Robust adaptive fault-tolerant control of nonlinear uncertain systems tracking uncertain target trajectory, Inf. Sci., 415, 446, 10.1016/j.ins.2017.06.023 Zhang, 2018, Optimal guaranteed cost sliding mode control for constrained-input nonlinear systems with matched and unmatched disturbances, IEEE Trans. Neural Netw. Learn. Syst., 29, 2112, 10.1109/TNNLS.2018.2791419 Zhang, 2017, Event-triggered H∞ control for continuous-time nonlinear system via concurrent learning, IEEE Trans. Syst. Man Cybern., 47, 1071, 10.1109/TSMC.2016.2531680 Zhao, 2017, Observer based adaptive dynamic programming for fault tolerant control of a class of nonlinear systems, Inf. Sci., 384, 21, 10.1016/j.ins.2016.12.016 Zhao, 2017, Decentralized control for large-scale nonlinear systems with unknown mismatched interconnections via policy iteration, IEEE Trans. Syst. Man Cybern. Zhong, 2017, An event-triggered ADP control approach for continuous-time system with unknown internal states, IEEE Trans. Cybern., 47, 683, 10.1109/TCYB.2016.2523878