A Distributed Cooperative Dynamic Task Planning Algorithm for Multiple Satellites Based on Multi-agent Hybrid Learning

Chinese Journal of Aeronautics - Tập 24 Số 4 - Trang 493-505 - 2011
Chong Wang1, Li Jun1, Ning Jing1, Jun Wang1, Hao Chen1
1College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China

Tóm tắt

Từ khóa


Tài liệu tham khảo

Morris, 2006, 397

Liu Y. Research on imaging reconnaissance satellite dynamic replanning model, algorithm and its application, PhD thesis, National University of Defense Technology, 2004. [In Chinese].

Kramer L A, Smith S F. Maximizing flexibility: a retraction heuristic for oversubscribed scheduling problems. Proceedings of the 18th International Joint Conference on Artificial Intelligence. 2003; 1218-1223.

Kramer L A, Smith S F. Task swapping for schedule improvement: a broader analysis. Proceeding of the 14th International Conference on Automated Planning and Scheduling. 2004; 235-243.

Wang, 2008

Chen, 2009

Peter S. Learning and multiagent reasoning for autonomous agents. Proceeding of International Joint Conference on Artificial Intelligence. 2007; 13-30.

Liviu, 2005, Cooperative multi-agent learning: the state of the art, Autonomous agents and multi-agent systems, Springer, 11, 387

Lucian, 2008, A Comprehensive survey of multiagent reinforcement learning, IEEE Transactions on Systems, Man, and Cybernetics—Part C: Applications and Reviews, 38, 156, 10.1109/TSMCC.2007.913919

Matthew, 2009, Transfer learning for reinforcement learning domains: a survey, Journal of Machine Learning Research, 10, 1633

Matthew E T, Shimon W, Peter S. Transfer via inter task mappings in policy search reinforcement learning. Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems, 2007.

Matthew E T, Gregory K, Peter S. Autonomous transfer for reinforcement learning. Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems. 2008.

Daniel S B, Robert G, Neil I, et al. The complexity of decentralized control of markov decision processes. Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence. 2000; 32-37.

Christopher J C H W. Learning from delayed rewards. PhD thesis, King's College, 1989.

Michael W, Jeffrey S R, Best-response multiagent learning in non-stationary environments. Proceedings of 3rd International Joint Conference on Autonomous Agents Multiagent System. 2004; 506-513.

Richard, 1998

Xin, 1999, Evolving artificial neural networks, Proceedings of the IEEE, 87, 1423, 10.1109/5.784219

Kenneth, 2002, Evolving neural networks through augmenting topologies, Evolutionary Computation, 10, 99, 10.1162/106365602320169811

Richard S S, David M, Satinder S, et al. Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems. 2000; 1057-1063.

Kohl N, Stone P. Policy gradient reinforcement learning for fast quadrupedal locomotion. Proceedings of the IEEE International Conference on Robotics and Automation. 2004; 2619-2624.

David, 1999, Evolutionary algorithms for reinforcement learning, Journal of Artificial Intelligence Research, 11, 199

Shimon W. Adaptive Representations for reinforcement learning. PhD thesis, University of Texas at Austin, 2007.

Mitsuo, 2000

Tan, 2006, A distributed cooperative coevolutionary algorithm for multiobjective optimization, IEEE Transactions on Evolutionary Computation, 10, 527, 10.1109/TEVC.2005.860762

Wang, 2001

Analytical Graphics Inc

Samir, 2004, An extended multi-agent negotiation protocol, Autonomous Agents and Multi-Agent Systems, 8, 5, 10.1023/B:AGNT.0000009409.19387.f8