Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms

Name: Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms
Keywords: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Science - Neural and Evolutionary Computing, Neural and Evolutionary Computing (cs.NE)

Niki van Stein; Anna V. Kononova; Lars Kotthoff; Thomas Bäck

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1145/371225...

Article . 2025 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2025

License: CC BY

Data sources: Datacite

Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 13 Jul 2025Embargo end date: 01 Jan 2025Publisher:ACMJournal:Proceedings of the Genetic and Evolutionary Computation Conference

Authors: Niki van Stein; Anna V. Kononova; Lars Kotthoff; Thomas Bäck;

doi: 10.1145/3712256.3726328 , 10.48550/arxiv.2503.16668

arXiv: 2503.16668

Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms

- Summary
- Subjects
- Metrics

Abstract

Large Language Models (LLMs) have demonstrated great promise in generating code, especially when used inside an evolutionary computation framework to iteratively optimize the generated algorithms. However, in some cases they fail to generate competitive algorithms or the code optimization stalls, and we are left with no recourse because of a lack of understanding of the generation process and generated codes. We present a novel approach to mitigate this problem by enabling users to analyze the generated codes inside the evolutionary process and how they evolve over repeated prompting of the LLM. We show results for three benchmark problem classes and demonstrate novel insights. In particular, LLMs tend to generate more complex code with repeated prompting, but additional complexity can hurt algorithmic performance in some cases. Different LLMs have different coding ``styles'' and generated code tends to be dissimilar to other LLMs. These two findings suggest that using different LLMs inside the code evolution frameworks might produce higher performing code than using only one LLM.

Accepted at GECCO 2025

Related Organizations

University of Wyoming
United States
Leiden University
Netherlands

Keywords

FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Science - Neural and Evolutionary Computing, Neural and Evolutionary Computing (cs.NE)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

Related to Research communities

Netherlands Research Portal