HPC-Coder: Modeling Parallel Programs using Large Language Models

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 May 2024Embargo end date: 01 Jan 2023Publisher:IEEEJournal:ISC High Performance 2024 Research Paper Proceedings (39th International Conference)Funded by:NSF | CAREER: Self-tuning Paral...

Authors: Nichols, Daniel; Marathe, Aniruddha; Menon, Harshitha; Gamblin, Todd; Bhatele, Abhinav;

doi: 10.23919/isc.2024.10528929 , 10.48550/arxiv.2306.17281

arXiv: 2306.17281

HPC-Coder: Modeling Parallel Programs using Large Language Models

- Summary
- Subjects
- Metrics

Abstract

Parallel programs in high performance computing (HPC) continue to grow in complexity and scale in the exascale era. The diversity in hardware and parallel programming models make developing, optimizing, and maintaining parallel software even more burdensome for developers. One way to alleviate some of these burdens is with automated development and analysis tools. Such tools can perform complex and/or remedial tasks for developers that increase their productivity and decrease the chance for error. Until recently, such tools for code development and performance analysis have been limited in the complexity of tasks they can perform, especially for parallel programs. However, with recent advancements in language modeling, and the availability of large amounts of open-source code related data, these tools have started to utilize predictive language models to automate more complex tasks. In this paper, we show how large language models (LLMs) can be applied to tasks specific to high performance and scientific codes. We introduce a new dataset of HPC and scientific codes and use it to fine-tune several pre-trained models. We compare several pre-trained LLMs on HPC-related tasks and introduce a new model, HPC-Coder, fine-tuned on parallel codes. In our experiments, we show that this model can auto-complete HPC functions where generic models cannot, decorate for loops with OpenMP pragmas, and model performance changes in scientific application repositories as well as programming competition solutions.

Related Organizations

Lawrence Berkeley National Laboratory
United States
Department of Computer Science University of Maryland
United States
Lawrence Livermore National Laboratory
United States
University of Maryland, College Park
United States
University of Maryland
United States

View all View all

Keywords

FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Artificial Intelligence, Distributed, Parallel, and Cluster Computing (cs.DC)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	14
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

14

Top 10%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Funded by

NSF| CAREER: Self-tuning Parallel Software and Systems