Taming the AI Curator: A Content Focused Data Description Diagnostic and Assistive Writing Tool

Nowacek, Zachary; Esteva, Maria; Chang, Bernard; Jaffe, Gabriel; Prodanovic, Masa

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Presentation . 2026

License: CC BY

Data sources: Datacite

ZENODO

Presentation . 2026

License: CC BY

Data sources: Datacite

Taming the AI Curator: A Content Focused Data Description Diagnostic and Assistive Writing Tool

descriptionPublicationkeyboard_double_arrow_right Presentation 17 Feb 2026Embargo end date: 17 Feb 2026 English Publisher:Zenodo

Authors: Nowacek, Zachary; Esteva, Maria; Chang, Bernard; Jaffe, Gabriel; Prodanovic, Masa;

doi: 10.5281/zenodo.18803941 , 10.5281/zenodo.18803940

Taming the AI Curator: A Content Focused Data Description Diagnostic and Assistive Writing Tool

- Summary
- Subjects
- Metrics

Abstract

We designed an AI based tool to diagnose and help users write clear, accurate, and complete data descriptions. The tool's components include best practices data description guidelines, data descriptions reviewed by experts as few-shot prompts, and chain of thought reasoning to explain the diagnostic outputs. We engineered our prompts and Large Language Model choice so that a score of 8 reflects an acceptable data description. Users can double check the evaluations and the assisted descriptions to minimize scores inconsistent with expert reviewers and hallucinated outputs. The application is crafted to match the standards of our field and to be used with guided intention.

Keywords

Paper, Few shot prompting, AI, Developing new curation tools and services, Curation challenges and opportunities from Artificial Intelligence and Machine Learning, Dataset descriptions, Innovation in curation methods, Chain of thought reasoning

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now