Inter-Coder Agreement for Computational Linguistics

descriptionPublicationkeyboard_double_arrow_right Article 01 Dec 2008 United Kingdom English Publisher:MIT Press - JournalsJournal:Computational Linguistics, volume 34, pages 555-596 (issn: 0891-2017, eissn: 1530-9312,

Copyright policy )

Authors: Ron Artstein; Massimo Poesio;

doi: 10.1162/coli.07-034-r2

Inter-Coder Agreement for Computational Linguistics

- Summary
- Subjects
- Metrics

Abstract

This article is a survey of methods for measuring agreement among corpus annotators. It exposes the mathematics and underlying assumptions of agreement coefficients, covering Krippendorff's alpha as well as Scott's pi and Cohen's kappa; discusses the use of coefficients in several annotation tasks; and argues that weighted, alpha-like coefficients, traditionally less used than kappa-like measures in computational linguistics, may be more appropriate for many corpus annotation tasks—but that their use makes the interpretation of the value of the coefficient even harder.

Country

United Kingdom

Related Organizations

USC Institute for Creative Technologies
United States
University of Southern California
United States
University of California System
United States
University of Essex
United Kingdom
University of Trento
Italy

Keywords

QA75 Electronic computers. Computer science, P Philology. Linguistics, 410, 400

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	656
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%