Dynamic Channel Access via Meta-Reinforcement Learning

Ziyang Lu, M. Cenk Gursoy

Research output: Chapter in Book/Entry/PoemConference contribution

2 Scopus citations

Abstract

In this paper, we address the channel access problem in a dynamic wireless environment via meta-reinforcement learning. Spectrum is a scarce resource in wireless communications, especially with the dramatic increase in the number of devices in networks. Recently, inspired by the success of deep reinforcement learning (DRL), extensive studies have been conducted in addressing wireless resource allocation problems via DRL. However, training DRL algorithms usually requires a massive amount of data collected from the environment for each specific task and the well-trained model may fail if there is a small variation in the environment. In this work, in order to address these challenges, we propose a meta-DRL framework that incorporates the method of Model-Agnostic Meta-Learning (MAML). In the proposed framework, we train a common initialization for similar channel selection tasks. From the initialization, we show that only a few gradient descents are required for adapting to different tasks drawn from the same distribution. We demonstrate the performance improvements via simulation results.

Original languageEnglish (US)
Title of host publication2021 IEEE Global Communications Conference, GLOBECOM 2021 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728181042
DOIs
StatePublished - 2021
Event2021 IEEE Global Communications Conference, GLOBECOM 2021 - Madrid, Spain
Duration: Dec 7 2021Dec 11 2021

Publication series

Name2021 IEEE Global Communications Conference, GLOBECOM 2021 - Proceedings

Conference

Conference2021 IEEE Global Communications Conference, GLOBECOM 2021
Country/TerritorySpain
CityMadrid
Period12/7/2112/11/21

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Health Informatics

Fingerprint

Dive into the research topics of 'Dynamic Channel Access via Meta-Reinforcement Learning'. Together they form a unique fingerprint.

Cite this