Coding Triangle: How Does Large Language Model Understand Code?

Name: Coding Triangle: How Does Large Language Model Understand Code?
Keywords: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Artificial Intelligence, Computation and Language, Computation and Language (cs.CL)

Taolin Zhang 0003; Zihan Ma 0010; Maosong Cao; Junnan Liu; Songyang Zhang 0001; Kai Chen 0026

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

https://dx.doi.org/10.48550/ar...

Article . 2025

License: CC BY

Data sources: Datacite

DBLP

Article

Data sources: DBLP

Coding Triangle: How Does Large Language Model Understand Code?

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2025Embargo end date: 01 Jan 2025Publisher:arXivJournal:CoRR, volume abs/2507.06138

Authors: Taolin Zhang 0003; Zihan Ma 0010; Maosong Cao; Junnan Liu; Songyang Zhang 0001; Kai Chen 0026;

doi: 10.48550/arxiv.2507.06138

arXiv: 2507.06138

Coding Triangle: How Does Large Language Model Understand Code?

- Summary
- Subjects
- Metrics

Abstract

Large language models (LLMs) have achieved remarkable progress in code generation, yet their true programming competence remains underexplored. We introduce the Code Triangle framework, which systematically evaluates LLMs across three fundamental dimensions: editorial analysis, code implementation, and test case generation. Through extensive experiments on competitive programming benchmarks, we reveal that while LLMs can form a self-consistent system across these dimensions, their solutions often lack the diversity and robustness of human programmers. We identify a significant distribution shift between model cognition and human expertise, with model errors tending to cluster due to training data biases and limited reasoning transfer. Our study demonstrates that incorporating human-generated editorials, solutions, and diverse test cases, as well as leveraging model mixtures, can substantially enhance both the performance and robustness of LLMs. Furthermore, we reveal both the consistency and inconsistency in the cognition of LLMs that may facilitate self-reflection and self-improvement, providing a potential direction for developing more powerful coding models.

Keywords

FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Artificial Intelligence, Computation and Language, Computation and Language (cs.CL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Related to Research communities

Knowmad Institut