Practical guide to Artificial Intelligence risks and mitigations for Trusted Research Environments and the RELEASE-AI framework

Crespi-Boixader, Alba; Simon, Li; Liley, James; Ward, Laura; Cole, Christian; Smith, Jim

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Other literature type . 2025

License: CC BY

Data sources: ZENODO

ZENODO

Conference object . 2025

License: CC BY

Data sources: Datacite

ZENODO

Conference object . 2025

License: CC BY

Data sources: Datacite

Practical guide to Artificial Intelligence risks and mitigations for Trusted Research Environments and the RELEASE-AI framework

descriptionPublicationkeyboard_double_arrow_right Conference object , Other literature type 30 Oct 2025Publisher:Zenodo

Authors: Crespi-Boixader, Alba; Simon, Li; Liley, James; Ward, Laura; Cole, Christian; Smith, Jim;

doi: 10.5281/zenodo.17483593 , 10.5281/zenodo.17483594

Practical guide to Artificial Intelligence risks and mitigations for Trusted Research Environments and the RELEASE-AI framework

- Summary
- Metrics

Abstract

Lay summary Trusted Research Environments (TREs) are secure computing environments providing access to sensitive or personal data, such as electronic healthcare records (EHRs), for approved research purposes. With the increasing application of Artificial Intelligence (AI) to patient data, new challenges arise on how to protect individuals’ data from disclosure risks. Here, we present a comprehensive guide to the types of risks different AI methods pose in terms of disclosure and privacy. In many cases, risks can be minimised or mitigated following specific strategies. Preventing release of personal and sensitive data must include everyone involved throughout the project lifecycle from the data controller, the model development team, the model owner, to the product users. Abstract There is substantial demand to use real-world data to inform improvements in the whole patient care cycle. EHRs and other types of personal data, are crucial to these developments and due to the size and complexity of the data AI techniques are increasingly being investigated. An early review and experience on working with TREs during the GRAIMatter and SACRO projects and within the SDC Reboot Community Interest Group shows an urge from TREs to better understand where the AI risks are and the corresponding mitigation strategies in terms of disclosure control. We present a taxonomy of AI risks and associated mitigations to help TREs, and project leads, to support their responsibilities in ensuring data privacy. The main objective is to provide an efficient comprehensive, agnostic, and scalable guide to assess the privacy risk of AI projects using sensitive data in TREs and apply mitigations. This TRE-relevant approach groups AI models according to the type of output they produce, rather than the algorithm used to train the model. This is a counterpart to the Statbarn taxonomy for ‘traditional outputs’, following an equivalent process. Thus, when faced with a project proposal with a release request for new type of AI model, the TRE staff need only to agree with the project team which of a small number of groups the model in question falls into. The rest of the disclosure control process or risk assessment and mitigation flows from there. For example, instance-based models store data. There are two possible mitigations for this risk: 1) anonymise the dataset used to train the model, and remove vectors where possible; 2) release such risky models only via a Model Query Control (MQC) system with controlled access for trusted users, mitigating the likelihood of uncontrolled data leakage. Other groups of models, however, have associated risks that can only be estimated by simulating attacks. SACRO-ML and similar tools can help to demonstrate that no individuals’ data can be identified either fully or partially.

Related Organizations

Durham University
United Kingdom
University of Dundee
United Kingdom
University of the West of England
United Kingdom

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green