TY - JOUR
T1 - Scalable Multi-Agent Reinforcement Learning for Dynamic Coordinated Multipoint Clustering
AU - Hu, Fenghe
AU - Deng, Yansha
AU - Hamid Aghvami, A.
N1 - Publisher Copyright:
© 1972-2012 IEEE.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - Reinforcement learning (RL) is a widely investigated intelligent algorithm and proved to be useful in the wireless communication area. However, for optimization problems in large-scale multi-cell networks whose dimension increases exponentially, it is unrealistic to employ a conventional centralized RL algorithm and make decisions for the entire network. Multi-agent RL, which allows distribute decision-making, is expected to solve the scalability problem but with performance issues due to the unknown global information, i.e., non-stationary environment. In this paper, we propose a parameter-sharing multi-agent RL for grouping decisions of coordinated multi-point in a large-scale network, where agents jointly serve users to enhance the cell-edge service. By sharing information via parameters, our theoretical and simulation results show that parameter sharing can largely benefit the multi-agent algorithm with convergence proof and convergence speed analysis. To reduce the effect of biased local heterogeneous experience, we also propose a transfer learning method for the parameter sharing process, whose performance of transfer learning algorithms is verified by the simulation results.
AB - Reinforcement learning (RL) is a widely investigated intelligent algorithm and proved to be useful in the wireless communication area. However, for optimization problems in large-scale multi-cell networks whose dimension increases exponentially, it is unrealistic to employ a conventional centralized RL algorithm and make decisions for the entire network. Multi-agent RL, which allows distribute decision-making, is expected to solve the scalability problem but with performance issues due to the unknown global information, i.e., non-stationary environment. In this paper, we propose a parameter-sharing multi-agent RL for grouping decisions of coordinated multi-point in a large-scale network, where agents jointly serve users to enhance the cell-edge service. By sharing information via parameters, our theoretical and simulation results show that parameter sharing can largely benefit the multi-agent algorithm with convergence proof and convergence speed analysis. To reduce the effect of biased local heterogeneous experience, we also propose a transfer learning method for the parameter sharing process, whose performance of transfer learning algorithms is verified by the simulation results.
KW - multi-agent systems
KW - next generation networking
KW - Reinforcement learning
KW - telecommunication
UR - http://www.scopus.com/inward/record.url?scp=85141623316&partnerID=8YFLogxK
U2 - 10.1109/TCOMM.2022.3220870
DO - 10.1109/TCOMM.2022.3220870
M3 - Article
AN - SCOPUS:85141623316
SN - 0090-6778
VL - 71
SP - 101
EP - 114
JO - IEEE TRANSACTIONS ON COMMUNICATIONS
JF - IEEE TRANSACTIONS ON COMMUNICATIONS
IS - 1
ER -