Distributed policy evaluation via inexact ADMM in multi-agent reinforcement learning
Tóm tắt
Từ khóa
Tài liệu tham khảo
Sutton, R., & Barto, A. (2018). Reinforcement Learning: An Introduction. Cambridge: The MIT press.
Busoniu, L., BabuÅka, R., & Schutter, B. (2008). A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews, 38(2), 156–172.
Cui, J., Liu, Y., & Nallanathan, A. (2019). Multi-agent reinforcement learning-based resource allocation for UAV networks. IEEE Transactions on Wireless Communications, 19(2), 729–743.
Lin, K. K., Zhao, R., & Xu, Z., et al. (2018). Efficient large-scale fleet management via multi-agent deep reinforcement learning. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1774–1783). London: ACM.
Silva, M., Souza, D., Souza, M., et al. (2019). A reinforcement learning-based multi-agent framework applied for solving routing and scheduling problems. Expert Systems with Applications, 131, 148–171.
Li, X., Zhang, J., & Bian, J., et al. (2019). A Cooperative multi-Agent reinforcement learning framework for resource balancing in complex logistics network. In Proceedings of the 18th International Conference on Autonomous Agents and Multiagent Systems (pp. 980–988). Canada: International Foundation for Autonomous Agents and Multiagent Systems.
Hernandez, L. P., Kartal, B., & Taylor, M. E. (2019). A survey and critique of multiagent deep reinforcement learning. Autonomous Agents and Multi-Agent Systems, 33(6), 750–797.
Zhang, C., Ahmad, M., & Wang, Y. (2019). ADMM based privacy-preserving decentralized optimization. IEEE Transactions on Information Forensics and Security, 14(3), 565–580.
Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning (pp. 157–163). New Brunswick: Morgan Kaufmann.
Foerster, J., Nardelli, N., & Farquhar, G., et al. (2017). Stabilising experience replay for deep multi-agent reinforcement learning. In Proceedings of the 34th International Conference on Machine Learning (pp. 1146–1155). Sydney: PMLR.
Foerster, J., Assael, I., & Freitas, N., et al. (2016). Learning to communicate with deep multi-agent reinforcement learning. In Proceedings of Advances in Neural Information Processing Systems. Barcelona: PMLR, 2137–2145.
Omidshafiei, S., Pazis, J., & Amato, C., et al. (2017). Deep decentralized multi-task multi-agent reinforcement learning under partial observability. In Proceedings of the 34th International Conference on Machine Learning (pp. 2681–2690). Sydney: PMLR.
Mathkar, A., & Borkar, V. S. (2017). Distributed reinforcement learning via gossip. IEEE Transactions on Automatic Control, 62(3), 1465–1470.
Kar, S., Moura, J. M., & Poor, H. V. (2013). QD-learning: A Collaborative distributed strategy for multi-agent reinforcement learning through consensus plus innovations. IEEE Transactions on Signal Processing, 61(7), 1848–1862.
Kar, S., Moura, J. M., & Poor, H. V. (2013). Distributed reinforcement learning in multi-agent networks. In Proceedings of the 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (pp. 296–299). St. Martin: IEEE.
Lee, D., Yoon, H., & Hovakimyan, N. (2018). Primal-dual algorithm for distributed reinforcement learning: distributed GTD. In Proceedings of the 57th IEEE Conference on Decision and Control (pp. 1967–1972). Miami: IEEE.
Wai, H., Yang, Z., & Wang, P., et al. (2018). Multi-agent reinforcement learning via double averaging primal-dual optimization. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems (pp. 9672–9683). Montreal: Curran Associates, Inc.
Zhang, K., Yang, Z., & Basar, T. (2018). Networked multi-agent reinforcement learning in continuous spaces. In Proceedings of the 57th IEEE Conference on Decision and Control (pp. 2771–2776). Miami: IEEE.
Zhang, K., Yang, Z., & Liu, H., et al. (2018). Fully decentralized multi-agent reinforcement learning with networked agents. In Proceedings of the 35th International Conference on Machine Learning (pp. 5867–5876). Stockholm: PMLR.
Yang, T., Yi, X., Wu, J., et al. (2019). A survey of distributed optimization. Annual Reviews in Control, 47, 278–305.
Boyd, S., Parikh, N., Chu, E., et al. (2011). Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine learning, 3(1), 1–122.
Shi, W., Ling, Q., Yuan, K., et al. (2014). On the linear convergence of the ADMM in decentralized consensus optimization. IEEE Transactions on Signal Processing, 62(7), 1750–1761.
Mokhtari, A., Shi, W., Ling, Q., et al. (2016). DQM: Decentralized quadratically approximated alternating direction method of multipliers. IEEE Transactions on Signal Processing, 64(19), 5158–5173.
Chang, T., Liao, W., Hong, M., et al. (2016). Asynchronous distributed ADMM for large-scale optimization–Part I: algorithm and convergence analysis. IEEE Transactions on Signal Processing, 64(12), 3118–3130.
Chang, T., Liao, W., Hong, M., et al. (2016). Asynchronous distributed ADMM for large-scale optimization–Part II: linear convergence analysis and numerical performance. IEEE Transactions on Signal Processing, 64(12), 3131–3144.
Sutton, R., Maei, H .R., Precup, D., et al. (2009). Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Proceedings of the 26th Annual International Conference on Machine Learning (pp. 993–1000). Montreal: ACM.
Dann, C., Neumann, G., & Peters, J. (2014). Policy evaluation with temporal differences: A survey and comparison. Journal of Machine Learning Research, 15(1), 809–883.