
The Croissant metadata format simplifies how data is used by ML models and provides a vocabulary for dataset attributes, streamlining how data is loaded across ML frameworks, including specification on Responsible AI and reproducibility. Croissant format export is going to be added in the Dataverse data repository and available for the whole Dataverse network. In this presentation, we will explore the innovation path for the rapid evolution of the Croissant standard. We will demonstrate how to implement all changes without modifying the source code, utilizing a novel approach known as FAIR semantic mappings. There is a significant intersection between Croissant and the DDI and DDI-CDI standards, which are commonly used in the Social Sciences and Humanities, and already supported by Dataverse and other data repositories. In this presentation, we'll explore how these standards can assist the Machine Learning community in mitigating bias in data."
machine learning, croissant standard, ddi, data repositories, croissant ml, cdi, dataverse
machine learning, croissant standard, ddi, data repositories, croissant ml, cdi, dataverse
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
