research data . Dataset . 2020

LatinISE subcorpora for SemEval 2020 task 1

McGillivray, Barbara; Schlechtweg, Dominik; Dubossarsky, Haim; Tahmasebi, Nina; Hengchen, Simon;
Open Access Latin
  • Published: 18 Feb 2020
  • Publisher: Zenodo
Abstract
<p>This data collection contains the Latin test data for <a href="https://competitions.codalab.org/competitions/20948">SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection</a>]:&nbsp;</p> <ul> <li>a Latin text corpus pair (`corpus1/lemma`, `corpus2/lemma`)</li> <li>40 lemmas which have been annotated for their lexical semantic change between the two corpora (`targets.txt`)</li> <li>the annotated binary change scores of the targets for subtask 1, and their annotated graded change scores for subtask 2 (`truth/`)</li> </ul> <p>The corpus data have been automatically lemmatized and part-of-speech tagged, and have been partially corrected by hand. For ...
Subjects
free text keywords: Latin, corpus
Download fromView all 6 versions
Zenodo
Dataset . 2020
Provider: Datacite
Zenodo
Dataset . 2020
Provider: Zenodo
Zenodo
Dataset . 2020
Provider: Datacite
Zenodo
Dataset . 2020
Provider: Zenodo
Any information missing or wrong?Report an Issue