A Distributed Cooperative Dynamic Task Planning Algorithm for Multiple Satellites Based on Multi-agent Hybrid Learning
Tóm tắt
Từ khóa
Tài liệu tham khảo
Morris, 2006, 397
Liu Y. Research on imaging reconnaissance satellite dynamic replanning model, algorithm and its application, PhD thesis, National University of Defense Technology, 2004. [In Chinese].
Kramer L A, Smith S F. Maximizing flexibility: a retraction heuristic for oversubscribed scheduling problems. Proceedings of the 18th International Joint Conference on Artificial Intelligence. 2003; 1218-1223.
Kramer L A, Smith S F. Task swapping for schedule improvement: a broader analysis. Proceeding of the 14th International Conference on Automated Planning and Scheduling. 2004; 235-243.
Wang, 2008
Chen, 2009
Peter S. Learning and multiagent reasoning for autonomous agents. Proceeding of International Joint Conference on Artificial Intelligence. 2007; 13-30.
Liviu, 2005, Cooperative multi-agent learning: the state of the art, Autonomous agents and multi-agent systems, Springer, 11, 387
Lucian, 2008, A Comprehensive survey of multiagent reinforcement learning, IEEE Transactions on Systems, Man, and Cybernetics—Part C: Applications and Reviews, 38, 156, 10.1109/TSMCC.2007.913919
Matthew, 2009, Transfer learning for reinforcement learning domains: a survey, Journal of Machine Learning Research, 10, 1633
Matthew E T, Shimon W, Peter S. Transfer via inter task mappings in policy search reinforcement learning. Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems, 2007.
Matthew E T, Gregory K, Peter S. Autonomous transfer for reinforcement learning. Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems. 2008.
Daniel S B, Robert G, Neil I, et al. The complexity of decentralized control of markov decision processes. Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence. 2000; 32-37.
Christopher J C H W. Learning from delayed rewards. PhD thesis, King's College, 1989.
Michael W, Jeffrey S R, Best-response multiagent learning in non-stationary environments. Proceedings of 3rd International Joint Conference on Autonomous Agents Multiagent System. 2004; 506-513.
Richard, 1998
Kenneth, 2002, Evolving neural networks through augmenting topologies, Evolutionary Computation, 10, 99, 10.1162/106365602320169811
Richard S S, David M, Satinder S, et al. Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems. 2000; 1057-1063.
Kohl N, Stone P. Policy gradient reinforcement learning for fast quadrupedal locomotion. Proceedings of the IEEE International Conference on Robotics and Automation. 2004; 2619-2624.
David, 1999, Evolutionary algorithms for reinforcement learning, Journal of Artificial Intelligence Research, 11, 199
Shimon W. Adaptive Representations for reinforcement learning. PhD thesis, University of Texas at Austin, 2007.
Mitsuo, 2000
Tan, 2006, A distributed cooperative coevolutionary algorithm for multiobjective optimization, IEEE Transactions on Evolutionary Computation, 10, 527, 10.1109/TEVC.2005.860762
Wang, 2001
Analytical Graphics Inc
