Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Preprint . 2026
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Preprint . 2026
Data sources: ZENODO
ZENODO
Preprint . 2026
Data sources: Datacite
ZENODO
Preprint . 2026
Data sources: Datacite
ZENODO
Preprint . 2026
Data sources: Datacite
versions View all 3 versions
addClaim

Quantum-Enhanced Carbon-Aware Scheduling for Large Language Model Inference

Authors: KHELIFI, Assil;

Quantum-Enhanced Carbon-Aware Scheduling for Large Language Model Inference

Abstract

As the computational demand for Large Language Models (LLMs) surges, minimizing the carbon footprint of inference has become a critical challenge. While classical schedulers optimize for throughput, they often neglect the spatial and temporal variance of grid carbon intensity. This paper presents a Hybrid Quantum-Classical (HQC) framework utilizing the Quantum Approximate Optimization Algorithm (QAOA) to solve the layer-to-hardware mapping problem with the explicit objective of minimizing gCO$_2$e emissions. We benchmark our QAOA optimizer against classical Brute Force and Genetic Algorithms across static, dynamic, and noisy environments. Our results demonstrate that QAOA achieves near-perfect optimality (gap<10−5) and successfully adapts to 24-hour grid fluctuations, realizing a simulated carbon saving of 23.76 gCO$_2$e. However, the study also reveals a "Simulation Wall" at N=15 layers, where classical simulation of the quantum circuit becomes computationally prohibitive, whereas Genetic Algorithms maintain speed at the cost of theoretical guarantees. We conclude that QAOA represents a scalable, robust pathway for green AI, provided the optimizer is migrated from simulation to physical Quantum Processing Units (QPUs).

Keywords

Quantum Computing, QAOA, Green AI, Carbon Awareness, Large Language Models, Scheduling, Hybrid Quantum-Classical Computing, Energy Efficiency, Computer Science, Quantum Physics, Artificial Intelligence, Environmental Science, Optimization, Data Centers, Sustainable Computing

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average