Name: Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning
Keywords: FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computation and Language (cs.CL)

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 11 Apr 2025Embargo end date: 01 Jan 2024Publisher:Association for the Advancement of Artificial Intelligence (AAAI)Journal:Proceedings of the AAAI Conference on Artificial Intelligence, volume 39, pages 25,949-25,957 (issn: 2159-5399, eissn: 2374-3468,

Authors: Zhang, Xinlu; Chen, Zhiyu Zoey; Ye, Xi; Yang, Xianjun; Chen, Lichang; Wang, William Yang; Petzold, Linda Ruth;

doi: 10.1609/aaai.v39i24.34789 , 10.48550/arxiv.2405.20535

arXiv: 2405.20535

Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Instruction Fine-Tuning (IFT) significantly enhances the zero-shot capabilities of pretrained Large Language Models (LLMs). While coding data is known to boost LLM reasoning abilities during pretraining, its role in activating internal reasoning capacities during IFT remains understudied. This paper investigates a key question: How does coding data impact LLMs' reasoning capacities during IFT stage? To explore this, we thoroughly examine the impact of coding data across different coding data proportions, model families, sizes, and reasoning domains, from various perspectives. Specifically, we create three IFT datasets with increasing coding data proportions, fine-tune six LLM backbones across different families and scales on these datasets, evaluate the tuned models' performance across twelve tasks in three reasoning domains, and analyze the outcomes from three broad-to-granular perspectives: overall, domain-level, and task-specific. Our holistic analysis provides valuable insights into each perspective. First, coding data tuning enhances the overall reasoning capabilities of LLMs across different model families and scales. Moreover, while the impact of coding data varies by domain, it shows consistent trends within each domain across different model families and scales. Additionally, coding data generally provides comparable task-specific benefits across model families, with optimal proportions in IFT datasets being task-dependent.

Related Organizations

University of California System
United States
University of Maryland
United States
The University of Texas at Dallas
United States
The University of Texas at Austin
United States
The University of Texas System
United States

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computation and Language (cs.CL)

1 Research products, page 1 of 1

stanford_alpaca software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Funded by

NIH| Leveraging Artificial Intelligence Solutions to Develop Digital Biomarkers for Precision Trauma Resuscitation

Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

1 Research products, page 1 of 1

stanford_alpaca software on GitHub