research data . Dataset . 2016

Palmetto position storing Lucene index of Dutch Wikipedia

van der Zwaan, Janneke M.; Marx, Maarten; Kamps, Jaap;
Open Access
  • Published: 22 Feb 2016
  • Publisher: Zenodo
Abstract
<p>Dutch language resource for calculating topic coherence with Palmetto [1, 2]. The dataset is a position storing Lucene index of the Dutch Wikipedia [3]. It was created in the context of the Netherlands eScience Center Dilipad project [4]. The pdf file contains the results of a case study that shows best topic coherence measure for topics consisting of Dutch nouns is NPMI.</p> <p>More details can be found in the README.</p> <p>[1] M. Roeder, A. Both, and A. Hinneburg. Exploring the space of topic coherence measures. In <em>Proceedings of the Eighth ACM International Conference on Web Search and Data Mining</em>, pages 399&ndash;408, 2015.</p> <p>[2] http://aks...
Subjects
free text keywords: topic modeling, topic coherence, Palmetto, Dutch, Wikipedia
Download fromView all 2 versions
Zenodo
Dataset . 2016
Provider: Zenodo
Zenodo
Dataset . 2016
Provider: Datacite
Powered by OpenAIRE Research Graph
Any information missing or wrong?Report an Issue