Downloads provided by UsageCounts
Scientific image data sets can be continuously enriched by labels describing new features which are relevant for some specific task. This process can be automated by means of Machine Learning (ML) techniques. Although such an approach shows clear advantages, especially when it is applied to large datasets, it also poses an important challenge: Relabeling image data sets curated by different scientists, in order to collectively use them for ML, requires a common agreement on the labels which can be used. This can be achieved thanks to the use of a standardized way to describe the label information: a metadata schema including vocabularies. Furthermore, machine-actionable decisions on the label information for relabeling can be enabled by the representation of images and schema-based metadata as FAIR Digital Objects (DOs). We introduce a metadata schema including vocabularies to describe ML image data represented as FAIR DOs that can be accessed for relabeling. The specifications of the metadata schema are presented. The relevance of a standardized metadata description including vocabularies for relabeling ML image data is emphasized. It is shown how the metadata is accessed with FAIR DOs and how vocabularies support automated relabeling. This contribution supplements the content of “FAIR DO Application Case for Composing Machine Learning Training Data” with a focus on the semantic aspects for relabeling. This work has been supported by the research program ‘Engineering Digital Futures’ of the Helmholtz Association of German Research Centers and the Helmholtz Metadata Collaboration Platform. This project has received funding from the ‘European Union’s Horizon 2020‘ research and innovation program under grant agreement No. 101007417 within the framework of the ‘NFFA-Europe Pilot‘ (NEP) Joint Activities.
ddc:004, Machine Learning, 020, Metadata, DATA processing & computer science, Schemas, info:eu-repo/classification/ddc/004, FAIR Digital Objects, 004, Vocabularies
ddc:004, Machine Learning, 020, Metadata, DATA processing & computer science, Schemas, info:eu-repo/classification/ddc/004, FAIR Digital Objects, 004, Vocabularies
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 10 | |
| downloads | 8 |

Views provided by UsageCounts
Downloads provided by UsageCounts