<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Non-imaging Medical Data Synthesis for Trustworthy AI: A Comprehensive Survey

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 09 Apr 2024Embargo end date: 01 Jan 2022Publisher:Association for Computing Machinery (ACM)Journal:ACM Computing Surveys, volume 56, pages 1-35 (issn: 0360-0300, eissn: 1557-7341,

Authors: Xiaodan Xing; Huanjun Wu; Lichao Wang; Iain Stenson; May Yong; Javier Del Ser; Simon Walsh; +1 Authors

doi: 10.1145/3614425 , 10.48550/arxiv.2209.09239

arXiv: http://arxiv.org/abs/2209.09239

Non-imaging Medical Data Synthesis for Trustworthy AI: A Comprehensive Survey

- Summary
- Subjects
- Metrics

Abstract

Data quality is a key factor in the development of trustworthy AI in healthcare. A large volume of curated datasets with controlled confounding factors can improve the accuracy, robustness, and privacy of downstream AI algorithms. However, access to high-quality datasets is limited by the technical difficulties of data acquisition, and large-scale sharing of healthcare data is hindered by strict ethical restrictions. Data synthesis algorithms, which generate data with distributions similar to real clinical data, can serve as a potential solution to address the scarcity of good quality data during the development of trustworthy AI. However, state-of-the-art data synthesis algorithms, especially deep learning algorithms, focus more on imaging data while neglecting the synthesis of non-imaging healthcare data, including clinical measurements, medical signals and waveforms, and electronic healthcare records (EHRs). Therefore, in this article, we will review synthesis algorithms, particularly for non-imaging medical data, with the aim of providing trustworthy AI in this domain. This tutorial-style review article will provide comprehensive descriptions of non-imaging medical data synthesis, covering aspects such as algorithms, evaluations, limitations, and future research directions.

Related Organizations

Imperial College London
United Kingdom
The Alan Turing Institute
United Kingdom

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%