Dataset for generating TL;DR

Dataset en OPEN
Syed, Shahbaz; Voelske, Michael; Potthast, Martin; Stein, Benno;
  • Publisher: Zenodo
  • Related identifiers: doi: 10.5281/zenodo.1168855
  • Subject: tl;dr challenge | abstractive summarization | social media | user-generated content

<p>This is the dataset for the TL;DR challenge containing posts&nbsp;from the Reddit corpus, suitable for abstractive summarization using deep learning. The format is a json file where each line is a JSON object representing a post. The schema of each post is shown belo... View more
Share - Bookmark

  • Download from
    Zenodo via Zenodo (Dataset, 2018)
  • Cite this research data