Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Presentation . 2025
License: CC BY
Data sources: Datacite
ZENODO
Presentation . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Introducing Corpora of Judgments for the German Federal Court of Justice (Bundesgerichtshof) in Criminal Matters (1950-1999, 2020–2024) and Civil Matters (2000-2024) [ESELS Conference Toulouse 2025]

Authors: Fobbe, Seán; Swalve, Tilko;

Introducing Corpora of Judgments for the German Federal Court of Justice (Bundesgerichtshof) in Criminal Matters (1950-1999, 2020–2024) and Civil Matters (2000-2024) [ESELS Conference Toulouse 2025]

Abstract

Abstract The Bundesgerichtshof (BGH), the German Federal Court of Justice, is the highest court of appeal in civil and criminal matters in Germany. BGH judgments receive regular and intense scrutiny, but German doctrinal scholars routinely focus only on a single decision or a small set of decisions. Quantitative approaches are often infeasible because of lack of data. We present two original datasets on the German Federal Court of Justice: 1) a corpus of all 77,892 judgments in civil and criminal matters for the period 2000-2024 with 36 variables plus the internal citation network and 2) a corpus of 36,316 judgments in criminal matters for the period 1950-1999 with 31 variables. We use state-of-the-art data engineering techniques to construct reproducible extract-transform-load (ETL) pipelines that automatically acquire, clean, test, analyze and document German Federal Court of Justice data from the official BGH database and an older criminal justice dataset provided to the authors. Citations are extracted from judgment full texts, analyzed and provided in GraphML format. Despite centuries of intense academic engagement with the law, quantitative and empirical approaches remain exceedingly rare in Germany. Lack of data, lack of training, lack of awareness and many other reasons have caused this situation. Solving the data problem is a first step to empowering German lawyers in the use of quantitative methods and may help with an empirical turn. Context The presentation was delivered by Seán Fobbe at the ESELS Conference Toulouse 2025 in Panel 2.2 on 19 June 2025. About the Speaker Seán Fobbe is a researcher at Ludwig-Maximilian-University of Munich (Germany). He specializes in international human rights law, humanitarian law and international criminal law, with a particular emphasis on the protection of cultural heritage in Iraq and the prosecution of atrocity crimes committed by ISIS/ISIL. As a data scientist, his interests lie in natural language processing (NLP), quantitative peace research, data engineering and statistical computing with the R programming language. Personal Website: www.seanfobbe.com

Keywords

Data Engineering, Criminal Law, Civil Law, Bundesgerichtshof, Network Analysis

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green