King's College London

Research portal

Channel Selection and Power Control for D2D Communication via Online Reinforcement Learning

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Original languageEnglish
Title of host publicationICC 2021 - IEEE International Conference on Communications, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728171227
DOIs
PublishedJun 2021
Event2021 IEEE International Conference on Communications, ICC 2021 - Virtual, Online, Canada
Duration: 14 Jun 202123 Jun 2021

Publication series

NameIEEE International Conference on Communications
ISSN (Print)1550-3607

Conference

Conference2021 IEEE International Conference on Communications, ICC 2021
Country/TerritoryCanada
CityVirtual, Online
Period14/06/202123/06/2021

Bibliographical note

Publisher Copyright: © 2021 IEEE. Copyright: Copyright 2021 Elsevier B.V., All rights reserved.

King's Authors

Abstract

In this paper, we address the problem of Device-to-Device communication (D2D) in the next generations of cellular networks, where the number of D2D pairs can grow large and, hence, improving spectral efficiency becomes a crucial design factor. More specifically, decentralized channel selection and power control by D2D pairs for interference mitigation without inflicting a heavy controlling overhead on the network become significant challenges in allocating resources. To this end, we introduce an online distributed reinforcement learning algorithm at D2D pairs to maximize network throughput, while guaranteeing both D2D users' and cellular users' (CUs) Quality of Service (QoS) under the dynamic wireless channel environment. To track and evaluate the performance of the proposed online algorithm, we define three metrics, i.e., D2D collision probability, D2D access rate and time-average network throughput. The simulation results confirm the convergence property of the proposed algorithm and shows improved performance in terms of three defined metrics as compared to the celebrated Q-learning-based method.

View graph of relations

© 2020 King's College London | Strand | London WC2R 2LS | England | United Kingdom | Tel +44 (0)20 7836 5454