A tutorial on reinforcement learning in selected aspects of communications and networking

Computer Communications - Tập 208 - Trang 89-110 - 2023
Piotr Boryło1, Edyta Biernacka1, Jerzy Domżał1, Bartosz Ka̧dziołka1, Mirosław Kantor1, Krzysztof Rusek1, Maciej Skała1, Krzysztof Wajda1, Robert Wójcik1, Wojciech Za̧bek1
1Institute of Telecommunications, AGH University of Science and Technology, Kraków, Poland

Tài liệu tham khảo

Yau, 2012, Reinforcement learning for context awareness and intelligence in wireless networks: Review, new features and open issues, J. Netw. Comput. Appl., 35, 253, 10.1016/j.jnca.2011.08.007 Wang, 2016, A survey on applications of model-free strategy learning in cognitive wireless networks, IEEE Commun. Surv. Tutor., 18, 1717, 10.1109/COMST.2016.2539923 Ashour, 2019, A survey of applying reinforcement learning techniques to multicast routing, 1145 Qian, 2019, Survey on reinforcement learning applications in communication networks, J. Commun. Inform. Netw., 4, 30, 10.23919/JCIN.2019.8917870 Gari, 2021, Reinforcement learning-based application autoscaling in the cloud: A survey, Eng. Appl. Artif. Intell., 102, 10.1016/j.engappai.2021.104288 Uprety, 2021, Reinforcement learning for IoT security: A comprehensive survey, IEEE Internet Things J., 8, 8693, 10.1109/JIOT.2020.3040957 Sharma, 2017, Literature survey of statistical, deep and reinforcement learning in natural language processing, 350 Bo, 2019, Design of no-go game algorithm based on reinforcement learning, 712 Hu, 2020, Application of deep reinforcement learning in the board game, 809 Skoulakis, 2012, Efficient reinforcement learning in adversarial games, 704 Thunyapoo, 2020, Self-parking car simulation using reinforcement learning approach for moderate complexity parking scenario, 576 Xiao, 2021, Research on driving decision of smart vehicles based on reinforcement learning, 1466 Liu, 2021, Reinforcement learning based decision making of operational indices in process industry under changing environment, IEEE Trans. Ind. Inform., 17, 2727, 10.1109/TII.2020.3005207 Xiao-hu, 2009, Research and application of reinforcement learning based on constraint MDP in coal mine, 687 Usha, 2019, Commodity and forex trade automation using deep reinforcement learning, 27 Le, 2021, A2GAN: A deep reinforcement-based learning algorithm for risk-aware in finance, IEEE Access, 9, 137165, 10.1109/ACCESS.2021.3117593 Hu, 2019, Deep reinforcement learning for optimizing finance portfolio management, 14 Blumrosen, 2019, Enhancing healthcare quality with reinforcement learning modeling, 1 Liu, 2020, A deep reinforcement learning approach for type 2 diabetes mellitus treatment, 1 Wang, 2020, Deep reinforcement learning of cooperative control with four robotic agents by MADDPG, 287 Bi, 2020, Real-time energy management of microgrid using reinforcement learning, 38 Bangari, 2021, A review on reinforcement learning based news recommendation systems and its challenges, 260 Liu, 2020, A dynamic bidding strategy based on model-free reinforcement learning in display advertising, IEEE Access, 8, 213587, 10.1109/ACCESS.2020.3037940 Yakovleva, 2019, Real-time bidding with soft actor-critic reinforcement learning in display advertising, 373 Yang, 2021, Hierarchical reinforcement learning with universal policies for multistep robotic manipulation, IEEE Trans. Neural Netw. Learn. Syst., 1, 10.1109/TNNLS.2021.3128269 Han, 2021, Synergy emergence in deep reinforcement learning for full-dimensional arm manipulation, IEEE Trans. Med. Robot. Bionics, 3, 498, 10.1109/TMRB.2021.3056924 Abadi, 2015 Paszke, 2019, PyTorch: An imperative style, high-performance deep learning library Babuschkin, 2020 Jay, 2019, A deep reinforcement learning perspective on internet congestion control, 3050 Li, 2019, QTCP: Adaptive congestion control with reinforcement learning, IEEE Trans. Netw. Sci. Eng., 6, 445, 10.1109/TNSE.2018.2835758 Bega, 2020, A machine learning approach to 5G infrastructure market optimization, IEEE Trans. Mob. Comput., 19, 498, 10.1109/TMC.2019.2896950 Zhong, 2019, A deep actor-critic reinforcement learning framework for dynamic multichannel access, IEEE Trans. Cogn. Commun. Netw., 5, 1125, 10.1109/TCCN.2019.2952909 Jacobson, 1988, Congestion avoidance and control, ACM Special Interest Group Data Commun., 25, 314 Dong, 2015, PCC: Re-architecting congestion control for consistent high performance, 395 Bega, 2017, Optimising 5G infrastructure markets: The business of network slicing, 1 Wang, 2018, Deep reinforcement learning for dynamic multichannel access in wireless networks, IEEE Trans. Cogn. Commun. Netw., 4, 257, 10.1109/TCCN.2018.2809722 Liu, 2010, Indexability of restless bandit problems and optimality of whittle index for dynamic multichannel access, IEEE Trans. Inform. Theory, 56, 5547, 10.1109/TIT.2010.2068950 Awduche, 2002 Awduche, 1999 Yuan, 2019, Research on routing optimization of SDN network using reinforcement learning method, 442 Lin, 2016, QoS-Aware adaptive routing in multi-layer hierarchical software defined networks: A reinforcement learning approach, 25 Qiu, 2019, A novel QoS-enabled load scheduling algorithm based on reinforcement learning in software-defined energy internet, Future Gener. Comput. Syst., 92, 43, 10.1016/j.future.2018.09.023 Wang, 2019, A reinforcement learning approach to energy efficiency and QoS in 5G wireless networks, IEEE J. Sel. Areas Commun., 37, 1413, 10.1109/JSAC.2019.2904365 N.T. Hai, D.-S. Kim, Efficient load balancing for multi-controller in SDN-based mission-critical networks, in: 2016 IEEE 14th International Conference on Industrial Informatics, INDIN, 2016, pp. 420–425. Hajjej, 2020, A distributed coverage hole recovery approach based on reinforcement learning for wireless sensor networks, Ad Hoc Netw., 101, 10.1016/j.adhoc.2020.102082 Simpson, 2020, Per-host DDoS mitigation by direct-control reinforcement learning, IEEE Trans. Netw. Serv. Manag., 17, 103, 10.1109/TNSM.2019.2960202 Sutton, 2018