
In recent years, partially due to the rise of AI, the trend in cancer research is to combine data from different sources and different modalities. For combining data from different data types within one modality, the main challenges include a lack of standardisation in the data format, data model, metadata model, and data access procedures. Hence, there is a clear need for harmonisation and interoperability. When combining and integrating data from different modalities, these challenges become even more pressing. In EOSC4Cancer, we focus on all these aspects in the context of the use cases that cover the patient journey from cancer prevention to diagnosis to treatment, laying the foundation of data trajectories and workflows for future cancer mission projects. To address the above challenges, in this deliverable, we formulate standard operating procedures (SOPs) per data type for data access, data models, and data interoperability. The results are SOPs for seven data types widely used in cancer research: exposome, cancer registry, screening, clinical, genomic, radiology, and pathology data. Instead of niche SOPs for the EOSC4Cancer datasets, we provide more general considerations and guidelines, so they can be used by the broader community. Lastly, we provide some insights in next steps to improve these SOPs.
EOSC4Cancer
EOSC4Cancer
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
