Deep Reinforcement Learning for Cooperative Content Caching in Vehicular Edge Computing and Networks

IEEE Internet of Things Journal - Tập 7 Số 1 - Trang 247-257 - 2020
Guanhua Qiao1, Supeng Leng1, Sabita Maharjan2, Yan Zhang3, Nirwan Ansari4
1School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China
2Center for Resilient Networks and Applications, Simula Metropolitan Center for Digital Engineering, Oslo, Norway
3Department of Informatics, University of Oslo, Oslo, Norway
4Department of Electrical and Computer Engineering, Advanced Networking Laboratory, New Jersey Institute of Technology, Newark, USA

Tóm tắt

Từ khóa


Tài liệu tham khảo

lillicrap, 2015, Continuous control with deep reinforcement learning, arXiv preprint arXiv 1509 02971

csaba, 2010, Algorithms for reinforcement learning, Synthesis Lectures on Artificial Intelligence and Machine Learning, 1

10.1109/MNET.2018.1800097

silver, 2014, Determinstic policy gradient algorithms, Proc Int Conf Mach Learn (ICML), 387

10.1109/TSP.2011.2165211

2016, Reimplementation of DDPG(Continuous Control With Deep Reinforcement Learning) Based on OpenAI Gym + Tensorflow

2016, DC Data Sets

10.1109/TVT.2017.2784562

kleinberg, 2004, Algorithm Design

10.1007/978-1-4615-6331-0

10.1109/TMC.2016.2597851

10.1109/GLOCOM.2016.7841723

10.1109/GLOCOM.2016.7842121

10.1109/JIOT.2018.2866947

10.1109/ACCESS.2017.2714191

10.1109/TVT.2015.2480711

10.1109/ICC.2018.8422220

zhang, 2018, A deep reinforcement learning based approach for cost- and energy-aware multi-flow mobile data offloading, IEICE Trans Commun, e101 b, 1625, 10.1587/transcom.2017CQP0014

10.1109/COMST.2018.2841349

10.1109/TVT.2015.2479942

10.1109/MWC.2018.1700303

10.1109/JSAC.2016.2545413

konda, 1999, Actor–critic algorithm, Proc NIPS, 13, 1008

10.3390/s16070974

10.1109/MCOM.2014.6736746

10.1109/SPAWC.2016.7536874

10.1109/JIOT.2019.2903191

10.1109/ACCESS.2017.2693440

10.3390/jsan5040017

zhang, 2018, Cache-enabled dynamic rate allocation via deep self-transfer reinforcement learning, arXiv 1893 11334v1

10.1109/JSTSP.2017.2787979

10.1016/j.mechatronics.2017.10.010

2018, Geohash Algorithm

breslau, 2014, Web caching and Zipf-like distribution: Evidence and implications, Proc IEEE Comput Commun Soc (INFOCOM), 126

10.1109/TITS.2011.2171951

10.1109/TCOMM.2014.2337890