Automated Unit Test Generation for the Google Test Framework Using Large Language Models : An Industrial Case Study

Name: Automated Unit Test Generation for the Google Test Framework Using Large Language Models : An Industrial Case Study
Creator: Lundberg, Albert
Keywords: Machine Learning, Unit Tests, LLM, Telekommunikation, Large Language Models, Test generation, Telecommunications, C++, Similarity

Lundberg, Albert

Found an issue? Give us feedback

Publikationer från L...arrow_drop_down

Publikationer från Linköpings universitet

Bachelor thesis . 2024

Data sources: Publikationer från Linköpings universitet

Automated Unit Test Generation for the Google Test Framework Using Large Language Models : An Industrial Case Study

descriptionPublicationkeyboard_double_arrow_right Bachelor thesis 01 Jan 2024 English Publisher:Linköpings universitet, Institutionen för datavetenskap

Authors: Lundberg, Albert;

Automated Unit Test Generation for the Google Test Framework Using Large Language Models : An Industrial Case Study

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Unit tests serve a critical role in software development in ensuring quality and pre-dictability for components of code. While unit tests are important, they are a resourceheavy part of the development life cycle. Hence, the automation of unit tests is an inten-sive research area. This paper looks at the use of Large Language Models (LLMs) to achievethis task by looking at both fine-tuning and in-context learning of various sizes in order toautomatically generate unit tests given a function, and seeing how they perform in terms ofa similarity score. In order to achieve this, this thesis proposes an implementation pipelinewhich consist of creating a dataset that maps functions to unit tests, training an LLM, Wiz-ardCoder, of various sizes on a GPU-cluster, running inference, and evaluating the qualityof the generated tests. These results that the fine-tuned models consistently outperformthe models using in-context learning, independent of size. Moreover, within the scope offine-tuned models, the models using more parameters performed slightly better than theirsmaller counterparts.

Related Organizations

Linköping University
Sweden

Keywords

Machine Learning, Unit Tests, LLM, Telekommunikation, Large Language Models, Test generation, Telecommunications, C++, Similarity

1 Research products, page 1 of 1

googletest software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Automated Unit Test Generation for the Google Test Framework Using Large Language Models : An Industrial Case Study

Automated Unit Test Generation for the Google Test Framework Using Large Language Models : An Industrial Case Study

1 Research products, page 1 of 1

googletest software on GitHub