Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
addClaim

PrevDistro - Preverb Distributions in Hungarian

Authors: Kalivoda, Ágnes;

PrevDistro - Preverb Distributions in Hungarian

Abstract

PrevDistro (Preverb Distributions) is an open-source dataset containing 41.5 million corpus occurrences of 49 preverb-verb construction types. It consists of the following columns: 1 sid: ID 2 constype: construction type 3 subtype: construction subtype 4 prevpos: preverb position 5 prev: preverb 6 verb: verb lemma 7 intervening: intervening words (as lemmas) 8 actform: actual form (the same content as in column 10, but this column is lowercase) 9 left: left context 10 kwic: keyword in context 11 right: right context 12 docid: document ID from the Hungarian Gigaword Corpus 13 title: document title 14 style: document style (e.g. official, press, ...) 15 region: document region (e.g. Transylvania, Subcarpathia, ...) 16 year: year of publication (sometimes several years can be found in one document) The first row stands for the header. If a cell's value is unspecified, it is marked with underscore (_).

PrevDistro 1.0.0 (deprecated) can be found at https://science-data.hu/dataset.xhtml?persistentId=doi:10.5072/FK2/TRSD50 In PrevDistro 2.0.0, several new columns were added and the already existing data has undergone some fixes as well.

Keywords

construction, preverb, linguistics, verbal prefix, verbal particle, Hungarian, preverb constructions

  • OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 14
    download downloads 8
  • 14
    views
    8
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
14
8