Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Project deliverable . 2025
License: CC BY
Data sources: Datacite
ZENODO
Project deliverable . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Deliverable 3.2: Federated workflow execution methods. First release

Authors: Barouh, Maria; Boytcheva, Svetla; Cazzaro, Mirco; Dell'Aglio, Daniele; Fabbian, Luca; Gyurov, Pavlin; Rodríguez, Juan Manuel; +1 Authors

Deliverable 3.2: Federated workflow execution methods. First release

Abstract

This deliverable introduces the Hereditary Data Network (HDN), a privacy-by-design federation architecture for medical data analytics within the HEREDITARY project. HDN addresses the need to perform cross-institutional analyses and model development over sensitive clinical, genomic, and imaging data without centralizing patient-level information, in compliance with regulations such as the GDPR, the Data Governance Act, and the emerging AI Act. HDN provides a unified semantic view of consortium data through the Hereditary Ontology (HERO) ontology and an ontology-mediated query interface. Researchers express information needs as SPARQL queries over a stable conceptual schema, while participating institutions retain full control over storage technologies, local schemas, and disclosure policies. Architecturally, HDN follows a hub-and-spoke model: a central orchestrator manages a vetted catalog of query templates, validates requests, enforces disclosure-level constraints at the semantic boundary, and dispatches instantiated queries to institutional endpoints. Each endpoint operates an Ontology-Based Data Access (OBDA)-based stack that maps ontology-level queries to its local schema, executes them under local privacy rules, and returns only admissible aggregated or record-level results. The deliverable makes four main contributions. First, it formalizes the evolution from the initial Ontology-Based Data Federation (OBDF) architecture to a native federation design that embeds privacy enforcement into the core protocol, rather than as an external layer. Second, it details the logical and reference implementations of HDN Central and HDN Endpoints, including interaction protocols, query lifecycle, and privacy controls. Third, it presents a benchmark comparing HDN against the legacy OBDF approach, showing improved scalability, more robust behavior as the number of endpoints grows, and better alignment with institutional privacy constraints. Finally, it demonstrates the applicability of HDN through three use cases: (1) federated queries on ALS clinical data at different disclosure levels, (2) a distributed SQL-based implementation of a machine learning algorithm, the Cox survival model, and (3) integration with Ontotext’s LinkedLifeData Inventory for FAIR-compliant external datasets. Together, these results show that HDN provides a practical and extensible foundation for federated analytics in HEREDITARY and prepares the ground for tighter integration with federated learning workflows in future project phases.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green
Related to Research communities