
handle: 10630/38565
5G New Radio (NR) network deployment in Non-Stand Alone (NSA) mode means that 5G networks rely on the control plane of existing Long Term Evolution (LTE) modules for control functions, while 5G modules are only dedicated to the user plane tasks, which could also be carried out by LTE modules simultaneously. The first deployments of 5G networks are essentially using this technology. These deployments enable what is known as E-UTRAN NR Dual Connectivity (ENDC), where a user establish a 5G connection simultaneously with a pre-existing LTE connection to boost their data rate. In this paper, a single Federated Deep Reinforcement Learning (FDRL) agent for the optimization of the event that triggers the dual connectivity between LTE and 5G is proposed. First, single Deep Reinforcement Learning (DRL) agents are trained in isolated cells. Later, these agents are merged into a unique global agent capable of optimizing the whole network with Federated Learning (FL). This scheme of training single agents and merging them also makes feasible the use of dynamic simulators for this type of learning algorithm and parameters related to mobility, by drastically reducing the number of possible combinations resulting in fewer simulations. The simulation results show that the final agent is capable of achieving a tradeoff between dropped calls and the user throughput to achieve global optimum without the need for interacting with all the cells for training.
This work was supported in part by Ericsson under Grant MA-2020-003774, through Project 702C2000043 in part by R&D&I Support Program Line through the Junta de Andalucía (Andalusian Regional Government) in part by the Ministerio de Asuntos Económicos y Transformación Digital in part by European Union - NextGenerationEU, and in part by the Recuperación, Transformación y Resiliencia y elMecanismo de Recuperación y Resiliencia through Project MAORI.
Optimization, Handover, Learning agent, Optimal network, Deep reinforcement learning algorithm, Network capacity, Federated learning, Heterogeneous network, Deep neural network, Small step, Deep reinforcement learning agent, Key performance indicators, Aprendizaje automático (Inteligencia artificial), Reinforcement learning algorithm, Training, Heuristic algorithms, Training phase, Training cell, Individual agency, Neighboring cells, Long term evolution, Deep reinforcement learning, 5 G NSA, Telecomunicaciones, Multi party computation, Hysteresis, Cell clusters, Rest of the cells, Reinforcement learning agent, Event B 1, Cellular networks, Deep learning, RAN Optimization, Rate of network, Throughput, 5 G Mobile communication, User equipment, Mobile edge computing
Optimization, Handover, Learning agent, Optimal network, Deep reinforcement learning algorithm, Network capacity, Federated learning, Heterogeneous network, Deep neural network, Small step, Deep reinforcement learning agent, Key performance indicators, Aprendizaje automático (Inteligencia artificial), Reinforcement learning algorithm, Training, Heuristic algorithms, Training phase, Training cell, Individual agency, Neighboring cells, Long term evolution, Deep reinforcement learning, 5 G NSA, Telecomunicaciones, Multi party computation, Hysteresis, Cell clusters, Rest of the cells, Reinforcement learning agent, Event B 1, Cellular networks, Deep learning, RAN Optimization, Rate of network, Throughput, 5 G Mobile communication, User equipment, Mobile edge computing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
