Using Schema-based Metadata for Image Labels accessed with FAIR Digital Objects

descriptionPublicationkeyboard_double_arrow_right Conference object , Other literature type , Article 01 Jan 2022 Germany Publisher:ZenodoFunded by:EC | NFFA-Europe

Authors: Blumenröhr, Nicolas; Aversa, Rossella;

doi: 10.5281/zenodo.7243870 , 10.5445/ir/1000155009 , 10.5281/zenodo.7243871

Using Schema-based Metadata for Image Labels accessed with FAIR Digital Objects

- Summary
- Subjects
- Metrics

Abstract

Scientific image data sets can be continuously enriched by labels describing new features which are relevant for some specific task. This process can be automated by means of Machine Learning (ML) techniques. Although such an approach shows clear advantages, especially when it is applied to large datasets, it also poses an important challenge: Relabeling image data sets curated by different scientists, in order to collectively use them for ML, requires a common agreement on the labels which can be used. This can be achieved thanks to the use of a standardized way to describe the label information: a metadata schema including vocabularies. Furthermore, machine-actionable decisions on the label information for relabeling can be enabled by the representation of images and schema-based metadata as FAIR Digital Objects (DOs). We introduce a metadata schema including vocabularies to describe ML image data represented as FAIR DOs that can be accessed for relabeling. The specifications of the metadata schema are presented. The relevance of a standardized metadata description including vocabularies for relabeling ML image data is emphasized. It is shown how the metadata is accessed with FAIR DOs and how vocabularies support automated relabeling. This contribution supplements the content of “FAIR DO Application Case for Composing Machine Learning Training Data” with a focus on the semantic aspects for relabeling. This work has been supported by the research program ‘Engineering Digital Futures’ of the Helmholtz Association of German Research Centers and the Helmholtz Metadata Collaboration Platform. This project has received funding from the ‘European Union’s Horizon 2020‘ research and innovation program under grant agreement No. 101007417 within the framework of the ‘NFFA-Europe Pilot‘ (NEP) Joint Activities.

Country

Germany

Related Organizations

Karlsruhe Institute of Technology
Germany

Keywords

ddc:004, Machine Learning, 020, Metadata, DATA processing & computer science, Schemas, info:eu-repo/classification/ddc/004, FAIR Digital Objects, 004, Vocabularies

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average