<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Generating Privacy Stories From Software Documentation

Name: Generating Privacy Stories From Software Documentation
Keywords: Software Engineering (cs.SE), FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Artificial Intelligence, Software Engineering

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2025Embargo end date: 01 Jan 2025Publisher:arXiv

Authors: Baldwin, Wilder; Chintakuntla, Shashank; Parajuli, Shreyah; Pourghasemi, Ali; Shanz, Ryan; Ghanavati, Sepideh;

doi: 10.48550/arxiv.2506.23014

arXiv: http://arxiv.org/abs/2506.23014

Generating Privacy Stories From Software Documentation

- Summary
- Subjects
- Metrics

Abstract

Research shows that analysts and developers consider privacy as a security concept or as an afterthought, which may lead to non-compliance and violation of users' privacy. Most current approaches, however, focus on extracting legal requirements from the regulations and evaluating the compliance of software and processes with them. In this paper, we develop a novel approach based on chain-of-thought prompting (CoT), in-context-learning (ICL), and Large Language Models (LLMs) to extract privacy behaviors from various software documents prior to and during software development, and then generate privacy requirements in the format of user stories. Our results show that most commonly used LLMs, such as GPT-4o and Llama 3, can identify privacy behaviors and generate privacy user stories with F1 scores exceeding 0.8. We also show that the performance of these models could be improved through parameter-tuning. Our findings provide insight into using and optimizing LLMs for generating privacy requirements given software documents created prior to or throughout the software development lifecycle.

Accepted to RENext!'25 at the 33rd IEEE International Requirements Engineering 2025 conference

Keywords

Software Engineering (cs.SE), FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Artificial Intelligence, Software Engineering

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Related to Research communities

Knowmad Institut