Toward Optimizing Reinforcement Learning Workload Placement at the Cloud-Edge Continuum in 6G Networks: A Scaled RL Framework

Ghafouri, Navideh; Vardakas, John; Ramantas, Kostas; Verikoukis, Christos

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Conference object . 2026

License: CC BY

Data sources: ZENODO

ZENODO

Article . 2026

License: CC BY

Data sources: Datacite

ZENODO

Article . 2026

License: CC BY

Data sources: Datacite

Toward Optimizing Reinforcement Learning Workload Placement at the Cloud-Edge Continuum in 6G Networks: A Scaled RL Framework

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 25 May 2026 English Publisher:Zenodo

Authors: Ghafouri, Navideh; Vardakas, John; Ramantas, Kostas; Verikoukis, Christos;

doi: 10.5281/zenodo.18875433 , 10.5281/zenodo.18875434

Toward Optimizing Reinforcement Learning Workload Placement at the Cloud-Edge Continuum in 6G Networks: A Scaled RL Framework

- Summary
- Subjects
- Metrics

Abstract

With the increasing deployment of Reinforcement Learning (RL) for network optimization at the edge of wirelessnetworks, the RL workload emerges as a significant challenge. While the placement of general Machine Learning workloadsacross the cloud–edge continuum has been widely studied, existing solutions typically exclude RL techniques due to theirdistinct structure and operational requirements. In this work, we propose a framework for RL workload placement in thecloud–edge continuum, enabling the scaling of RL actor processes across both domains. In this framework, agents that interact with the environment through simple feedback loops are deployed at the edge, while training and model storage are performed in the cloud, where sufficient computational resources are available. We implement and simulate a prototype of one scaled RL actor that performs Quality-of-Service-aware resource block assignment with separate threads for environment interaction, inference, buffering/sampling, and the learning process. Finally, we outline the open challenges of the proposed framework.

Related Organizations

Keywords

G Networks, Cloud-Edge Continuum, Reinforcement Learning, Workload Placement

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average