Powered by OpenAIRE graph

Found an issue? Give us feedback

ZENODOarrow_drop_down

Software . 2025

License: http://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: ZENODO

Software . 2026

License: http://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: ZENODO

Software . 2025

License: http://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: ZENODO

Software . 2025

License: http://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: ZENODO

Software . 2025

License: http://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: ZENODO

Software . 2026

License: https://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: Datacite

Software . 2025

License: https://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: Datacite

Software . 2026

License: https://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: Datacite

Software . 2025

License: https://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: Datacite

Software . 2025

License: https://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: Datacite

Software . 2025

License: https://www.gnu.org/licenses/gpl-3.0-standalone.html

Data sources: Datacite

View all 6 versions

RISE-UNIBAS/humanities_data_benchmark

integration_instructionsResearch softwarekeyboard_double_arrow_right Software 19 Sep 2025Publisher:Zenodo

Authors: Hindermann, Maximilian; Marti, Sorin; Alberto, Anthea; Burkhardt, Sven; Decker, Eric; Frick, Pema; Kasper, Lea; +4 Authors

Code repository: https://github.com/RISE-UNIBAS/humanities_data_benchmark/tree/v0.4.0

doi: 10.5281/zenodo.16941752 , 10.5281/zenodo.17866725 , 10.5281/zenodo.18293269 , 10.5281/zenodo.17255628 , 10.5281/zenodo.17475190 , 10.5281/zenodo.17157635

RISE-UNIBAS/humanities_data_benchmark

- Summary
- Subjects
- Metrics

Abstract

This repository contains benchmark datasets (images and text), prompts, ground truths, and evaluation scripts for assessing the performance of large language models (LLMs) on humanities-related tasks. The suite is designed as a resource for researchers and practitioners interested in systematically evaluating how well various LLMs perform on digital humanities (DH) tasks involving visual and text-like materials. For detailed test results and model comparisons, visit our results dashboard at https://rise-services.rise.unibas.ch/benchmarks/.

If you use this software, please cite it using the metadata from this file.

Keywords

LLM, benchmark, digital humanities

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Powered by OpenAIRE graph

Found an issue? Give us feedback

selected citations

0

Average

Average

Average

Related to Research communities

Digital Humanities and Cultural Heritage