research data . Dataset . 2020

args.me corpus

Ajjour, Yamen; Wachsmuth, Henning; Kiesel, Johannes; Potthast, Martin; Hagen, Matthias; Stein, Benno;
Open Access English
  • Published: 27 Oct 2020
  • Publisher: Zenodo
Abstract
The args.me corpus comprises 387 740 arguments. They are crawled from the debate portals Debatewise (14 353 arguments), IDebate.org (13 522 arguments), Debatepedia (21 197 arguments), and Debate.org (338 620 arguments). Moreover, the corpus contains 48 arguments from Canadian Parliament discussions. The arguments are extracted using heuristics that are designed for each debate portal. These arguments are the ones currently provided through the args.me search engine. Note that the args API does not return the sourceText (which is indexed by args.me an included in this dataset) due to its size. Cite args.me as Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, ...
Subjects
free text keywords: Computational Argumentation, Argument Search, Information Retrieval, Natural Language Processing
Any information missing or wrong?Report an Issue