CompilerDream: Learning a Compiler World Model for General Code Optimization

Name: CompilerDream: Learning a Compiler World Model for General Code Optimization
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Programming Languages, Programming Languages (cs.PL), Machine Learning (cs.LG)

Chaoyi Deng; Jialong Wu; Ningya Feng; Jianmin Wang; Mingsheng Long

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1145/371189...

Article . 2025 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2024

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

CompilerDream: Learning a Compiler World Model for General Code Optimization

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 03 Aug 2025Embargo end date: 01 Jan 2024Publisher:ACMJournal:Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2

Authors: Chaoyi Deng; Jialong Wu; Ningya Feng; Jianmin Wang; Mingsheng Long;

doi: 10.1145/3711896.3736887 , 10.48550/arxiv.2404.16077

arXiv: 2404.16077

CompilerDream: Learning a Compiler World Model for General Code Optimization

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Effective code optimization in compilers is crucial for computer and software engineering. The success of these optimizations primarily depends on the selection and ordering of the optimization passes applied to the code. While most compilers rely on a fixed sequence of optimization passes, current methods to find the optimal sequence either employ impractically slow search algorithms or learning methods that struggle to generalize to code unseen during training. We introduce CompilerDream, a model-based reinforcement learning approach to general code optimization. CompilerDream comprises a compiler world model that accurately simulates the intrinsic properties of optimization passes and an agent trained on this model to produce effective optimization strategies. By training on a large-scale program dataset, CompilerDream is equipped to serve as a general code optimizer across various application scenarios and source-code languages. Our extensive experiments first highlight CompilerDream's strong optimization capabilities for autotuning, where it leads the CompilerGym leaderboard. More importantly, the zero-shot generalization ability of large-scale trained compiler world model and agent, excels across diverse datasets, surpassing LLVM's built-in optimizations and other state-of-the-art methods in both settings of value prediction and end-to-end code optimization.

KDD 2025 camera-ready version with extended appendix. Code is available at https://github.com/thuml/CompilerDream

Related Organizations

Tsinghua University
China (People's Republic of)

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Programming Languages, Programming Languages (cs.PL), Machine Learning (cs.LG)

1 Research products, page 1 of 1

Dataset of "CompilerDream: Learning a Compiler World Model for General Code Optimization"
2025IsSupplementedBy

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

CompilerDream: Learning a Compiler World Model for General Code Optimization

CompilerDream: Learning a Compiler World Model for General Code Optimization

1 Research products, page 1 of 1

Dataset of "CompilerDream: Learning a Compiler World Model for General Code Optimization"