research data . Dataset . 2020

Temporally-Informed Analysis of Named Entity Recognition

Rijhwani, Shruti; Preoțiuc-Pietro, Daniel;
Open Access English
  • Published: 17 Jun 2020
  • Publisher: Zenodo
Abstract
<p>This repository contains the data set developed for the paper:</p> <p>&ldquo;Shruti Rijhwani and Daniel Preoțiuc-Pietro. <em>Temporally-Informed Analysis of Named Entity Recognition.</em> In Proceedings of the Association for Computational Linguistics (ACL). 2020.&rdquo;</p> <p>It includes 12,000 tweets annotated for the named entity recognition task. The tweets are uniformly distributed over the years 2014-2019, with 2,000 tweets from each year. The goal is to have a temporally diverse corpus to account for data drift over time when building NER models.</p> <p>The entity types annotated are locations (LOC), persons (PER) and organizations (ORG). The tweets a...
Subjects
free text keywords: named entity recognition, twitter, ner, twitter ner, tweets, temporal analysis, information extraction
Download fromView all 3 versions
Zenodo
Dataset . 2020
Provider: Datacite
Zenodo
Dataset . 2020
Provider: Zenodo
Zenodo
Dataset . 2020
Provider: Datacite
Zenodo
Dataset . 2020
Provider: Datacite
Any information missing or wrong?Report an Issue