
LinkAhead is an open source research data management platform whose core functionality rests on a powerful semantic search engine. The engine exposes a purpose-built query language called CQL that resembles English, enabling both interactive exploration through a graphical user interface and programmatic access in automated workflows. Despite its intuitive design, mastering the query syntax, particularly for complex, multi-hop traversals of LinkAhead’s graph-structured metadata (e.g., reference and link queries), remains a barrier for non-technical users and hampers broader adoption.To lower this barrier, we have developed a dedicated query interface powered by a custom-trained Large Language Model (LLM). The model is fine-tuned exclusively on a corpus of LinkAhead query pairs, consisting of a natural language description and a formal query. The system turns questions that users enter into LinkAhead into searches, handling both easy lookups and complex connections, without users needing to learn the query language.In this presentation, we will describe the prototype’s design and present preliminary validation results that assess the model’s quality. This event was part of the Data Days Lower Saxony 2025 - Virtual Theme Day. The event is organized by the Lower Saxony Research Data Management Initiative (FDM-NDS). FDM-NDS is a joint project under the umbrella of Hochschule.digital Niedersachsen and is funded by zukunft.niedersachsen, a funding program of the Lower Saxony Ministry of Science and Culture (MWK) and the Volkswagen Foundation.
Data Days Niedersachsen, Forschungsdatenmanagement
Data Days Niedersachsen, Forschungsdatenmanagement
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
