Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Presentation . 2025
License: CC BY
Data sources: Datacite
ZENODO
Presentation . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

From Questions to Data: Incorporating SoftwareIncorporating LLMs into RDM Software

Authors: Schlemmer, Alexander;

From Questions to Data: Incorporating SoftwareIncorporating LLMs into RDM Software

Abstract

LinkAhead is an open source research data management platform whose core functionality rests on a powerful semantic search engine. The engine exposes a purpose-built query language called CQL that resembles English, enabling both interactive exploration through a graphical user interface and programmatic access in automated workflows. Despite its intuitive design, mastering the query syntax, particularly for complex, multi-hop traversals of LinkAhead’s graph-structured metadata (e.g., reference and link queries), remains a barrier for non-technical users and hampers broader adoption.To lower this barrier, we have developed a dedicated query interface powered by a custom-trained Large Language Model (LLM). The model is fine-tuned exclusively on a corpus of LinkAhead query pairs, consisting of a natural language description and a formal query. The system turns questions that users enter into LinkAhead into searches, handling both easy lookups and complex connections, without users needing to learn the query language.In this presentation, we will describe the prototype’s design and present preliminary validation results that assess the model’s quality. This event was part of the Data Days Lower Saxony 2025 - Virtual Theme Day. The event is organized by the Lower Saxony Research Data Management Initiative (FDM-NDS). FDM-NDS is a joint project under the umbrella of Hochschule.digital Niedersachsen and is funded by zukunft.niedersachsen, a funding program of the Lower Saxony Ministry of Science and Culture (MWK) and the Volkswagen Foundation.

Keywords

Data Days Niedersachsen, Forschungsdatenmanagement

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green