research data . Dataset . 2020

Dataset: Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board

Papasavva, Antonis; Zannettou, Savvas; Cristofaro, Emiliano De; Stringhini, Gianluca; Blackburn, Jeremy;
Open Access
  • Published: 13 Jan 2020
  • Publisher: Zenodo
Abstract
<p>This is the dataset released with the <a href="https://arxiv.org/abs/2001.07487">paper</a> titled:&nbsp;&quot;Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board&quot;.</p> <p>The dataset is a single&nbsp;<a href="http://ndjson.org/">Newline delimited JSON file</a>. Each line in the file consists of a JSON object which is a full 4chan /pol/ thread.&nbsp;The JSON objects contain&nbsp;all the&nbsp;key/values returned by the <a href="https://github.com/4chan/4chan-API/blob/master/pages/Threads.md">4chan API</a>, along with three additional keys&nbsp;(<em>entities,&nbsp;perspectives</em>, and <em>extracted_poster_id</e...
Subjects
free text keywords: Biochemistry, Microbiology, Genetics, Neuroscience, Evolutionary Biology, 111714 Mental Health, 110309 Infectious Diseases, Plant Biology, 60506 Virology, 80699 Information Systems not elsewhere classified, entity, nbsp, score, spaCy Python library, 3.5 Years, post, 4 chan API, dataset, JSON, Incorrect, Lost Kek, Emiliano De Cristofaro, object, Augmented 4 chan, Mental Health, Infectious Diseases, Virology, Information Systems not elsewhere classified
Download fromView all 6 versions
Zenodo
Dataset . 2020
Provider: Datacite
figshare
Dataset . 2020
Provider: figshare
Zenodo
Dataset . 2020
Provider: Zenodo
Zenodo
Dataset . 2020
Provider: Datacite
Any information missing or wrong?Report an Issue