Distributed policy evaluation via inexact ADMM in multi-agent reinforcement learning

Control Theory and Technology - Tập 18 Số 4 - Trang 362-378 - 2020

Xiaoxiao Zhao¹, Peng Yi², Li Li²

¹College of Electronic and Information Engineering, Tongji University, Shanghai, China

²Institute of Intelligent Science and Technology, Tongji University, Shanghai, China

Tóm tắt

Từ khóa

Tài liệu tham khảo

Sutton, R., & Barto, A. (2018). Reinforcement Learning: An Introduction. Cambridge: The MIT press.

Busoniu, L., BabuÅka, R., & Schutter, B. (2008). A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews, 38(2), 156–172.

Cui, J., Liu, Y., & Nallanathan, A. (2019). Multi-agent reinforcement learning-based resource allocation for UAV networks. IEEE Transactions on Wireless Communications, 19(2), 729–743.

Lin, K. K., Zhao, R., & Xu, Z., et al. (2018). Efficient large-scale fleet management via multi-agent deep reinforcement learning. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1774–1783). London: ACM.

Silva, M., Souza, D., Souza, M., et al. (2019). A reinforcement learning-based multi-agent framework applied for solving routing and scheduling problems. Expert Systems with Applications, 131, 148–171.

Li, X., Zhang, J., & Bian, J., et al. (2019). A Cooperative multi-Agent reinforcement learning framework for resource balancing in complex logistics network. In Proceedings of the 18th International Conference on Autonomous Agents and Multiagent Systems (pp. 980–988). Canada: International Foundation for Autonomous Agents and Multiagent Systems.

Hernandez, L. P., Kartal, B., & Taylor, M. E. (2019). A survey and critique of multiagent deep reinforcement learning. Autonomous Agents and Multi-Agent Systems, 33(6), 750–797.

Zhang, C., Ahmad, M., & Wang, Y. (2019). ADMM based privacy-preserving decentralized optimization. IEEE Transactions on Information Forensics and Security, 14(3), 565–580.

Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning (pp. 157–163). New Brunswick: Morgan Kaufmann.

Foerster, J., Nardelli, N., & Farquhar, G., et al. (2017). Stabilising experience replay for deep multi-agent reinforcement learning. In Proceedings of the 34th International Conference on Machine Learning (pp. 1146–1155). Sydney: PMLR.

Foerster, J., Assael, I., & Freitas, N., et al. (2016). Learning to communicate with deep multi-agent reinforcement learning. In Proceedings of Advances in Neural Information Processing Systems. Barcelona: PMLR, 2137–2145.

Omidshafiei, S., Pazis, J., & Amato, C., et al. (2017). Deep decentralized multi-task multi-agent reinforcement learning under partial observability. In Proceedings of the 34th International Conference on Machine Learning (pp. 2681–2690). Sydney: PMLR.

Mathkar, A., & Borkar, V. S. (2017). Distributed reinforcement learning via gossip. IEEE Transactions on Automatic Control, 62(3), 1465–1470.

Kar, S., Moura, J. M., & Poor, H. V. (2013). QD-learning: A Collaborative distributed strategy for multi-agent reinforcement learning through consensus plus innovations. IEEE Transactions on Signal Processing, 61(7), 1848–1862.

Kar, S., Moura, J. M., & Poor, H. V. (2013). Distributed reinforcement learning in multi-agent networks. In Proceedings of the 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (pp. 296–299). St. Martin: IEEE.

Lee, D., Yoon, H., & Hovakimyan, N. (2018). Primal-dual algorithm for distributed reinforcement learning: distributed GTD. In Proceedings of the 57th IEEE Conference on Decision and Control (pp. 1967–1972). Miami: IEEE.

Wai, H., Yang, Z., & Wang, P., et al. (2018). Multi-agent reinforcement learning via double averaging primal-dual optimization. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems (pp. 9672–9683). Montreal: Curran Associates, Inc.

Zhang, K., Yang, Z., & Basar, T. (2018). Networked multi-agent reinforcement learning in continuous spaces. In Proceedings of the 57th IEEE Conference on Decision and Control (pp. 2771–2776). Miami: IEEE.

Zhang, K., Yang, Z., & Liu, H., et al. (2018). Fully decentralized multi-agent reinforcement learning with networked agents. In Proceedings of the 35th International Conference on Machine Learning (pp. 5867–5876). Stockholm: PMLR.

Yang, T., Yi, X., Wu, J., et al. (2019). A survey of distributed optimization. Annual Reviews in Control, 47, 278–305.

Boyd, S., Parikh, N., Chu, E., et al. (2011). Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine learning, 3(1), 1–122.

Shi, W., Ling, Q., Yuan, K., et al. (2014). On the linear convergence of the ADMM in decentralized consensus optimization. IEEE Transactions on Signal Processing, 62(7), 1750–1761.

Mokhtari, A., Shi, W., Ling, Q., et al. (2016). DQM: Decentralized quadratically approximated alternating direction method of multipliers. IEEE Transactions on Signal Processing, 64(19), 5158–5173.

Chang, T., Liao, W., Hong, M., et al. (2016). Asynchronous distributed ADMM for large-scale optimization–Part I: algorithm and convergence analysis. IEEE Transactions on Signal Processing, 64(12), 3118–3130.

Chang, T., Liao, W., Hong, M., et al. (2016). Asynchronous distributed ADMM for large-scale optimization–Part II: linear convergence analysis and numerical performance. IEEE Transactions on Signal Processing, 64(12), 3131–3144.

Sutton, R., Maei, H .R., Precup, D., et al. (2009). Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Proceedings of the 26th Annual International Conference on Machine Learning (pp. 993–1000). Montreal: ACM.

Dann, C., Neumann, G., & Peters, J. (2014). Policy evaluation with temporal differences: A survey and comparison. Journal of Machine Learning Research, 15(1), 809–883.

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích ảnh hưởng của các bài báo, công bố khoa học Việt Nam và Quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ SciBase

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Hệ thống hội thảo khoa học Việt Nam

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA

Thông tin liên hệ & hỗ trợ

Đơn vị chủ quản, phát triển và vận hành: Công ty Cổ phần Metis

Địa chỉ liên hệ: 26A Lê Đức Thọ, Phường Từ Liêm, Thành phố Hà Nội

Số giấy chứng nhận ĐKKD: 0109293202 cấp ngày 03/08/2020 tại Sở Kế hoạch và Đầu tư thành phố Hà Nội

Người quản lý và chịu trách nhiệm nội dung: Nguyễn Ngọc Sơn

Hotline: 0566.685.688

Email: [email protected]