Name: Studying Model Design Biases in LLMs for Multilingual Historical Newspaper Extraction; The Messina Earthquake Case Study
Keywords: bias evaluation, [SHS.HIST] Humanities and Social Sciences/History, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, historical newspapers, article extraction

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Conference object 15 Sep 2025 English Publisher:Springer Nature Switzerland

Authors: Sarah Oberbichler; Johanna Mauermann; The Trung Tran; Carlos-Emiliano González-Gallardo;

doi: 10.1007/978-3-032-05409-8_16

Studying Model Design Biases in LLMs for Multilingual Historical Newspaper Extraction; The Messina Earthquake Case Study

- Summary
- Subjects
- Metrics

Abstract

Large language models offer new opportunities for processing historical documents, yet their application raises questions of reliability. We present the first comprehensive and explainability-driven framework for evaluating model design bias in multilingual historical news article extraction, using newspaper coverage of the 1908 Messina earthquake as our test case across German, English, and French sources. Through systematic analysis of six state-of-the-art models, we uncover three critical bias patterns that, in addition to data quality, compromise extraction quality: contextual integration bias, overconfidence bias, and preference bias. Our evaluation reveals that these biases stem from alignment procedures rather than training data limitations, findings that establish methodological foundations for responsible AI deployment in digital humanities.

Related Organizations

Laboratory of Fundamental and Applied Computer Science of Tours
France
Leibniz Association
Germany
National Research Institute for Agriculture, Food and Environment
France
Leibniz Institute of European History
Germany
Huazhong University of Science and Technology
China (People's Republic of)

View all View all

Keywords

bias evaluation, [SHS.HIST] Humanities and Social Sciences/History, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, historical newspapers, article extraction

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Related to Research communities

INRAE