Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties
Tài liệu tham khảo
Abu-Khalaf, 2005, Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach, Automatica, 41, 779, 10.1016/j.automatica.2004.11.034
Basar, 1995
Bu, 2016, Neural-approximation-based robust adaptive control of flexible air-breathing hypersonic vehicles with parametric uncertainties and control input constraints, Inf. Sci., 346, 29, 10.1016/j.ins.2016.01.093
Chowdhary, 2011, A singular value maximizing data recording algorithm for concurrent learning, 3547
Fu, 2016, Online solution of two-player zero-sum games for continuous-time nonlinear systems with completely unknown dynamics, IEEE Trans. Neural Netw. Learn. Syst., 27, 2577, 10.1109/TNNLS.2015.2496299
Gao, 2018, Stabilization of nonlinear systems using event-triggered controllers with dwell times, Inf. Sci., 457–458, 156, 10.1016/j.ins.2018.04.002
Hornik, 1990, Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks, Neural Netw., 3, 551, 10.1016/0893-6080(90)90005-6
Ioannou, 2012
Jiang, 2014, Robust adaptive dynamic programming and feedback stabilization of nonlinear systems, IEEE Trans. Neural Netw. Learn. Syst., 25, 882, 10.1109/TNNLS.2013.2294968
Kamalapurkar, 2017, Model-based reinforcement learning for infinite-horizon approximate optimal tracking, IEEE Trans. Neural Netw. Learn. Syst., 28, 753, 10.1109/TNNLS.2015.2511658
Khalil, 2002
Lewis, 1999
Li, 2018, Manifold regularized reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst., 29, 932, 10.1109/TNNLS.2017.2650943
Lin, 2007
Littman, 2015, Reinforcement learning improves behaviour from evaluative feedback, Nature, 521, 445, 10.1038/nature14540
Liu, 2014, Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems, IEEE Trans. Cybern., 44, 2834, 10.1109/TCYB.2014.2357896
Liu, 2017
Liu, 2018, Robust event-triggered control for networked control systems, Inf. Sci., 459, 186, 10.1016/j.ins.2018.02.057
Liu, 2015, Reinforecement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints, IEEE Trans. Cybern., 45, 1372, 10.1109/TCYB.2015.2417170
Luo, 2017, Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems, Inf. Sci., 411, 66, 10.1016/j.ins.2017.05.005
Mahadevan, 2007, Proto-value functions: a Laplacian framework for learning representation and control in Markov decision processes, J. Mach. Learn. Res., 8, 2169
Modares, 2015, h∞ Tracking control of completely unknown continuous-time systems via off-policy reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst., 26, 2550, 10.1109/TNNLS.2015.2441749
Modares, 2014, Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems, Automatica, 50, 193, 10.1016/j.automatica.2013.09.043
Mu, 2017, Adaptive tracking control for a class of continuous-time uncertain nonlinear systems using the approximate solution of HJB equation, Neurocomputing, 260, 432, 10.1016/j.neucom.2017.04.043
Narayanan, 2017, Event-triggered distributed control of nonlinear interconnected systems using online reinforcement learning with exploration, IEEE Trans. Cybern.
Narendra, 1987, A new adaptive law for robust adaptation without persistent excitation, IEEE Trans. Automat. Control, 32, 134, 10.1109/TAC.1987.1104543
Song, 2017, Neural-network-based synchronous iteration learning method for multi-player zero-sum games, Neurocomputing, 242, 73, 10.1016/j.neucom.2017.02.051
Stevens, 2015
Vamvoudakis, 2010, Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem, Automatica, 46, 878, 10.1016/j.automatica.2010.02.018
Vamvoudakis, 2017, Game theory-based control system algorithms with real-time reinforcement learning: how to solve multiplayer games online, IEEE Control Syst., 37, 33, 10.1109/MCS.2016.2621461
Vrabie, 2013
Wang, 2016, Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties, Inf. Sci., 366, 121, 10.1016/j.ins.2016.05.034
Wang, 2014, Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming, Inf. Sci., 282, 167, 10.1016/j.ins.2014.05.050
Wang, 2016, Fault-tolerant controller design for a class of nonlinear MIMO discrete-time systems via online reinforcement learning algorithm, IEEE Trans. Syst. Man Cybern., 46, 611, 10.1109/TSMC.2015.2478885
Wei, 2017, Adaptive dynamic programming-based optima control scheme for energy storage systems with solar renewable energy, IEEE Trans. Ind. Electron., 64, 5468, 10.1109/TIE.2017.2674581
Whiteson, 2010
Xu, 2014, A clustering-based graph Laplacian framework for value function approximation in reinforcement learning, IEEE Trans. Cybern., 44, 2613, 10.1109/TCYB.2014.2311578
Xu, 2017, Manifold-based reinforcement learning via locally linear reconstruction, IEEE Trans. Neural Netw. Learn. Syst., 28, 934, 10.1109/TNNLS.2015.2505084
Yang, 2018, Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances, Neural Netw., 99, 19, 10.1016/j.neunet.2017.11.022
Yang, 2017, Event-triggered optimal neuro-controller design with reinforcement learning for unknown nonlinear systems, IEEE Trans. Syst. Man Cybern.
Yang, 2017, Adaptive dynamic programming for robust neural control of unknown continuous-time nonlinear systems, IET Control Theory Appl., 11, 2307, 10.1049/iet-cta.2017.0154
Yang, 2016, Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning, Inf. Sci., 369, 731, 10.1016/j.ins.2016.07.051
Yang, 2015, Direct adaptive control for a class of discrete-time unknown nonaffine nonlinear systems using neural networks, Int. J. Robust Nonlinear Control, 25, 1844, 10.1002/rnc.3181
Zhang, 2017, Robust adaptive fault-tolerant control of nonlinear uncertain systems tracking uncertain target trajectory, Inf. Sci., 415, 446, 10.1016/j.ins.2017.06.023
Zhang, 2018, Optimal guaranteed cost sliding mode control for constrained-input nonlinear systems with matched and unmatched disturbances, IEEE Trans. Neural Netw. Learn. Syst., 29, 2112, 10.1109/TNNLS.2018.2791419
Zhang, 2017, Event-triggered H∞ control for continuous-time nonlinear system via concurrent learning, IEEE Trans. Syst. Man Cybern., 47, 1071, 10.1109/TSMC.2016.2531680
Zhao, 2017, Observer based adaptive dynamic programming for fault tolerant control of a class of nonlinear systems, Inf. Sci., 384, 21, 10.1016/j.ins.2016.12.016
Zhao, 2017, Decentralized control for large-scale nonlinear systems with unknown mismatched interconnections via policy iteration, IEEE Trans. Syst. Man Cybern.
Zhong, 2017, An event-triggered ADP control approach for continuous-time system with unknown internal states, IEEE Trans. Cybern., 47, 683, 10.1109/TCYB.2016.2523878