Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Conference object . 2025
License: CC BY
Data sources: Datacite
ZENODO
Conference object . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Practical guide to Artificial Intelligence risks and mitigations for Trusted Research Environments and the RELEASE-AI framework

Authors: Crespi-Boixader, Alba; Simon, Li; Liley, James; Ward, Laura; Cole, Christian; Smith, Jim;

Practical guide to Artificial Intelligence risks and mitigations for Trusted Research Environments and the RELEASE-AI framework

Abstract

Lay summary Trusted Research Environments (TREs) are secure computing environments providing access to sensitive or personal data, such as electronic healthcare records (EHRs), for approved research purposes. With the increasing application of Artificial Intelligence (AI) to patient data, new challenges arise on how to protect individuals’ data from disclosure risks. Here, we present a comprehensive guide to the types of risks different AI methods pose in terms of disclosure and privacy. In many cases, risks can be minimised or mitigated following specific strategies. Preventing release of personal and sensitive data must include everyone involved throughout the project lifecycle from the data controller, the model development team, the model owner, to the product users. Abstract There is substantial demand to use real-world data to inform improvements in the whole patient care cycle. EHRs and other types of personal data, are crucial to these developments and due to the size and complexity of the data AI techniques are increasingly being investigated. An early review and experience on working with TREs during the GRAIMatter and SACRO projects and within the SDC Reboot Community Interest Group shows an urge from TREs to better understand where the AI risks are and the corresponding mitigation strategies in terms of disclosure control. We present a taxonomy of AI risks and associated mitigations to help TREs, and project leads, to support their responsibilities in ensuring data privacy. The main objective is to provide an efficient comprehensive, agnostic, and scalable guide to assess the privacy risk of AI projects using sensitive data in TREs and apply mitigations. This TRE-relevant approach groups AI models according to the type of output they produce, rather than the algorithm used to train the model. This is a counterpart to the Statbarn taxonomy for ‘traditional outputs’, following an equivalent process. Thus, when faced with a project proposal with a release request for new type of AI model, the TRE staff need only to agree with the project team which of a small number of groups the model in question falls into. The rest of the disclosure control process or risk assessment and mitigation flows from there. For example, instance-based models store data. There are two possible mitigations for this risk: 1) anonymise the dataset used to train the model, and remove vectors where possible; 2) release such risky models only via a Model Query Control (MQC) system with controlled access for trusted users, mitigating the likelihood of uncontrolled data leakage. Other groups of models, however, have associated risks that can only be estimated by simulating attacks. SACRO-ML and similar tools can help to demonstrate that no individuals’ data can be identified either fully or partially.

Related Organizations
  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green