
Most scientific disciplines use research software to address relevant research questions, and in many cases this research software is tailored to the specific research questions. Thus, research software is defined as software created during the research process or for a research purpose [1]. Within the German National Research Infrastructure (NFDI), research software plays two important roles. First, the services and tools developed by the consortia can be considered research software. These are mostly related to infrastructure software as defined by Hasselbring et al., but not exclusively [2]. Second, research software that supports a research project is considered an important research artifact itself. Several consortia of the NFDI are consequently working to provide infrastructure for these artifacts. Their significance is evident in their crucial function of research software, for example, in natural sciences for climate and meteorological modeling, in social sciences for network analysis and sentiment analysis, in digital humanities for text mining and computational linguistics, and in engineering sciences for simulating complex systems. Whether being a service/tool or a research artifact, provenance and rich descriptions of research software are essential to improve scientific transparency, findability, citability, reproducibility, software security, and reusability. Therefore, to improve the FAIRness of research software [3], software metadata plays an important role in the implementation of the FAIR4RS principles [4], [5]. To enable good software metadata, common metadata standards play an important role. Consequently, the working group "Research Software Metadata" of the Section Metadata, Terminology, and Provenance of the NFDI focuses on metadata for research software. The overall goal of this working group is to provide a common core metadata vocabulary for research software that can be used to describe research software created in the various research domains of the NFDI consortia. The work will create a unified framework that can be used across the different NFDI consortia generating research software artifacts. This effort also addresses key interoperability challenges, such as aligning different metadata schemas, ensuring consistency across disciplines, and integrating with existing standards such as CodeMeta [6], Citation File Format (CFF) [7] and schema.org. To gain an overview of research software metadata schemas already in use in the different consortia, we have conducted a survey on the implementation of research software metadata in the NFDI consortia [8]. We found that only a few domain-specific metadata schemas for research software were in use by different consortia. In contrast, generic metadata schemas such as CodeMeta, Bioschemas, schema.org, and the CFF were most frequently used to describe research software. Relevant metadata elements for research software include algorithms, methods, software/hardware requirements, and tasks performed by the software. We subsequently performed an in depth analysis of the alignment of the generic metadata schemas by creating crosswalks, i.e. mapping the schemas onto each other. This crosswalk analysis provides the basis for further discussions with the wider RDM community regarding strategies for provenance and rich descriptions of research software. [1] https://doi.org/10.5281/zenodo.5504016 [2] https://doi.org/10.1109/MCSE.2025.3555023 [3] https://doi.org/10.5281/zenodo.10816032 [4] https://doi.org/10.15497/RDA00068 [5] https://doi.org/10.5281/zenodo.10246218 [6] https://codemeta.github.io/terms/ [7] https://doi.org/10.5281/zenodo.5171937 [8] https://doi.org/10.5281/zenodo.12704414
ddc:004, Software Metadata, DATA processing & computer science, Research Software, info:eu-repo/classification/ddc/004, Research Software Metadata
ddc:004, Software Metadata, DATA processing & computer science, Research Software, info:eu-repo/classification/ddc/004, Research Software Metadata
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
