Skip to main navigation Skip to search Skip to main content

Adaptive proportional fair parameterization based LTE scheduling using continuous actor-critic reinforcement learning

  • Ioan-Sorin Comşa
  • , Sijing Zhang
  • , Mehmet Emin Aydin
  • , Jianping Chen
  • , Pierre Kuonen
  • , Jean–Frédéric Wagen
  • University of Applied Sciences Western Switzerland

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

19 Citations (Scopus)

Abstract

Maintaining a desired trade-off performance between system throughput maximization and user fairness satisfaction constitutes a problem that is still far from being solved. In LTE systems, different tradeoff levels can be obtained by using a proper parameterization of the Generalized Proportional Fair (GPF) scheduling rule. Our approach is able to find the best parameterization policy that maximizes the system throughput under different fairness constraints imposed by the scheduler state. The proposed method adapts and refines the policy at each Transmission Time Interval (TTI) by using the Multi-Layer Perceptron Neural Network (MLPNN) as a non-linear function approximation between the continuous scheduler state and the optimal GPF parameter(s). The MLPNN function generalization is trained based on Continuous Actor-Critic Learning Automata Reinforcement Learning (CACLA RL). The double GPF parameterization optimization problem is addressed by using CACLA RL with two continuous actions (CACLA-2). Five reinforcement learning algorithms as simple parameterization techniques are compared against the novel technology. Simulation results indicate that CACLA-2 performs much better than any of other candidates that adjust only one scheduling parameter such as CACLA-1. CACLA-2 outperforms CACLA-1 by reducing the percentage of TTIs when the system is considered unfair. Being able to attenuate the fluctuations of the obtained policy, CACLA-2 achieves enhanced throughput gain when severe changes in the scheduling environment occur, maintaining in the same time the fairness optimality condition.
Original languageEnglish
Title of host publicationnan
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages4387-4393
ISBN (Print)9781479935116
DOIs
Publication statusPublished - 12 Feb 2015
Event2014 IEEE Global Communications Conference - Austin
Duration: 8 Dec 201412 Dec 2014

Conference

Conference2014 IEEE Global Communications Conference
CityAustin
Period8/12/1412/12/14
Other2014 IEEE Global Communications Conference (08/12/2014-12/12/2014, Austin)

Keywords

  • CACLA-1
  • CACLA-2
  • CQI
  • Fairness
  • GPF
  • LTE-A
  • MLPNN
  • RL
  • Scheduling rule
  • TTI
  • Throughput
  • policy

Fingerprint

Dive into the research topics of 'Adaptive proportional fair parameterization based LTE scheduling using continuous actor-critic reinforcement learning'. Together they form a unique fingerprint.

Cite this