Webis Text Reuse Corpus 2012

Dataset en OPEN
Potthast, Martin; Hagen, Matthias; Völske, Michael; Gomoll, Jakob; Stein, Benno;
(2012)
  • Publisher: Zenodo
  • Related identifiers: doi: 10.5281/zenodo.1341602
  • Subject: Inorganic Chemistry | representative environment | 150 topics | Sociology | document | Microbiology | corpus | Medicine | TREC | source | interaction logs | Space Science | Ecology | Genetics | Evolutionary Biology | crowdsourcing platform oDesk | Cancer | evaluation efforts | Webis Text Reuse Corpus 2012
    • FOR: 80699 Information Systems not elsewhere classified
    acm: ComputingMethodologies_DOCUMENTANDTEXTPROCESSING

<p>The Webis Text Reuse Corpus 2012 (Webis-TRC-12) compiles manually written documents obtained from a completely controlled, yet representative environment that emulates the web. Each document in the corpus is about one of the 150 topics used at the TREC Web Tracks 200... View more
Share - Bookmark