research data . Dataset . 2020 corpus

Ajjour, Yamen; Wachsmuth, Henning; Kiesel, Johannes; Potthast, Martin; Hagen, Matthias; Stein, Benno;
Open Access English
  • Published: 27 Oct 2020
  • Publisher: Zenodo
The corpus comprises 387 740 arguments. They are crawled from the debate portals Debatewise (14 353 arguments), (13 522 arguments), Debatepedia (21 197 arguments), and (338 620 arguments). Moreover, the corpus contains 48 arguments from Canadian Parliament discussions. The arguments are extracted using heuristics that are designed for each debate portal. These arguments are the ones currently provided through the search engine. Note that the args API does not return the sourceText (which is indexed by an included in this dataset) due to its size. Cite as Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, ...
free text keywords: Computational Argumentation, Argument Search, Information Retrieval, Natural Language Processing
Any information missing or wrong?Report an Issue