Name: Federated Deep Reinforcement Learning for ENDC Optimization
Keywords: Optimization, Handover, Learning agent, Optimal network, Deep reinforcement learning algorithm, Network capacity, Federated learning, Heterogeneous network, Deep neural network, Small step

descriptionPublicationkeyboard_double_arrow_right Article 01 Jun 2025Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Mobile Computing, volume 24, pages 5,525-5,535 (issn: 1536-1233, eissn: 2161-9875,

Authors: Adrian Martin; Isabel de-la-Bandera; Adriano Mendo; Jose Outes; Juan Ramiro; Raquel Barco;

doi: 10.1109/tmc.2025.3534661

handle: 10630/38565

Federated Deep Reinforcement Learning for ENDC Optimization

- Summary
- Subjects
- Metrics

Abstract

5G New Radio (NR) network deployment in Non-Stand Alone (NSA) mode means that 5G networks rely on the control plane of existing Long Term Evolution (LTE) modules for control functions, while 5G modules are only dedicated to the user plane tasks, which could also be carried out by LTE modules simultaneously. The first deployments of 5G networks are essentially using this technology. These deployments enable what is known as E-UTRAN NR Dual Connectivity (ENDC), where a user establish a 5G connection simultaneously with a pre-existing LTE connection to boost their data rate. In this paper, a single Federated Deep Reinforcement Learning (FDRL) agent for the optimization of the event that triggers the dual connectivity between LTE and 5G is proposed. First, single Deep Reinforcement Learning (DRL) agents are trained in isolated cells. Later, these agents are merged into a unique global agent capable of optimizing the whole network with Federated Learning (FL). This scheme of training single agents and merging them also makes feasible the use of dynamic simulators for this type of learning algorithm and parameters related to mobility, by drastically reducing the number of possible combinations resulting in fewer simulations. The simulation results show that the final agent is capable of achieving a tradeoff between dropped calls and the user throughput to achieve global optimum without the need for interacting with all the cells for training.

This work was supported in part by Ericsson under Grant MA-2020-003774, through Project 702C2000043 in part by R&D&I Support Program Line through the Junta de Andalucía (Andalusian Regional Government) in part by the Ministerio de Asuntos Económicos y Transformación Digital in part by European Union - NextGenerationEU, and in part by the Recuperación, Transformación y Resiliencia y elMecanismo de Recuperación y Resiliencia through Project MAORI.

Related Organizations

University of Malaga
Spain

Keywords

Optimization, Handover, Learning agent, Optimal network, Deep reinforcement learning algorithm, Network capacity, Federated learning, Heterogeneous network, Deep neural network, Small step, Deep reinforcement learning agent, Key performance indicators, Aprendizaje automático (Inteligencia artificial), Reinforcement learning algorithm, Training, Heuristic algorithms, Training phase, Training cell, Individual agency, Neighboring cells, Long term evolution, Deep reinforcement learning, 5 G NSA, Telecomunicaciones, Multi party computation, Hysteresis, Cell clusters, Rest of the cells, Reinforcement learning agent, Event B 1, Cellular networks, Deep learning, RAN Optimization, Rate of network, Throughput, 5 G Mobile communication, User equipment, Mobile edge computing

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average