2 Research products, page 1 of 1
Loading
- Publication . Article . 2021Open Access EstonianAuthors:Ene Vainik; Geda Paulsen; Ahti Lohk;Ene Vainik; Geda Paulsen; Ahti Lohk;
doi: 10.5128/erya17.16
Publisher: Uppsala universitet, Finsk-ugriska språkCountry: SwedenThis study is motivated by the need for a statistical benchmark that would help the lexicographer to judge a morphological form for its grammaticalization stage to the degree of an independent lexeme. The focus of this article is on Estonian substantives and in particular their forms in the 11 semantic cases. The main research question of this study is: is there a statistical sign indicating that a case form of a noun is emerging as a potentially independent lexeme? Based on the normal distribution of nominal case form frequencies, we established a statistic that determines a case form’s elicitation in a corpus – the distribution index (D-index). The D-index can be used as an indicator of the correspondence of a particular form’s actual frequency with the predicted elicitation degree. Artikkel tegeleb nimisõnavormide iseseisvumise küsimusega leksikograafia vajadustest lähtudes. Eeldusel, et abstraktseid käändevorme iseloomustab korpuses üldine püsiv esiletuleku proportsioon, pakume välja statistilise mõõdiku – distributsiooniindeksi, mille abil otsustada, kas sõnavormi kasutussagedus on piisav selleks, et lugeda teda paradigmast emantsipeerunuks ning seega iseseisva märksõna kandidaadiks. Indeks arvestab vormi suhtelist sagedust korpuses, võrreldes tegelikku ja normi põhjal oodatavat kasutussagedust, ning laseb samale skaalale paigutada väga erineva absoluutsagedusega juhtumeid. Artiklis illustreerime distributsiooniindeksi toimivust tavaliste rikkaliku vormistikuga nimisõnade ning ambivorme andvate sõnade vormistike võrdlusena. Seame provisoorse indeksi lävendväärtuse, millest suurema väärtusega vormi võib pidada iseseisvaks lekseemiks. Indeksit ning lävendväärtust testitakse erinevate korpuste (EtTenTen13, ÜK 2019) andmete peal.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Article . 2020Open Access EstonianAuthors:Lindström, Kati;Lindström, Kati;Publisher: KTH, Historiska studier av teknik, vetenskap och miljöCountry: Sweden
QC 20200415
2 Research products, page 1 of 1
Loading
- Publication . Article . 2021Open Access EstonianAuthors:Ene Vainik; Geda Paulsen; Ahti Lohk;Ene Vainik; Geda Paulsen; Ahti Lohk;
doi: 10.5128/erya17.16
Publisher: Uppsala universitet, Finsk-ugriska språkCountry: SwedenThis study is motivated by the need for a statistical benchmark that would help the lexicographer to judge a morphological form for its grammaticalization stage to the degree of an independent lexeme. The focus of this article is on Estonian substantives and in particular their forms in the 11 semantic cases. The main research question of this study is: is there a statistical sign indicating that a case form of a noun is emerging as a potentially independent lexeme? Based on the normal distribution of nominal case form frequencies, we established a statistic that determines a case form’s elicitation in a corpus – the distribution index (D-index). The D-index can be used as an indicator of the correspondence of a particular form’s actual frequency with the predicted elicitation degree. Artikkel tegeleb nimisõnavormide iseseisvumise küsimusega leksikograafia vajadustest lähtudes. Eeldusel, et abstraktseid käändevorme iseloomustab korpuses üldine püsiv esiletuleku proportsioon, pakume välja statistilise mõõdiku – distributsiooniindeksi, mille abil otsustada, kas sõnavormi kasutussagedus on piisav selleks, et lugeda teda paradigmast emantsipeerunuks ning seega iseseisva märksõna kandidaadiks. Indeks arvestab vormi suhtelist sagedust korpuses, võrreldes tegelikku ja normi põhjal oodatavat kasutussagedust, ning laseb samale skaalale paigutada väga erineva absoluutsagedusega juhtumeid. Artiklis illustreerime distributsiooniindeksi toimivust tavaliste rikkaliku vormistikuga nimisõnade ning ambivorme andvate sõnade vormistike võrdlusena. Seame provisoorse indeksi lävendväärtuse, millest suurema väärtusega vormi võib pidada iseseisvaks lekseemiks. Indeksit ning lävendväärtust testitakse erinevate korpuste (EtTenTen13, ÜK 2019) andmete peal.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Article . 2020Open Access EstonianAuthors:Lindström, Kati;Lindström, Kati;Publisher: KTH, Historiska studier av teknik, vetenskap och miljöCountry: Sweden
QC 20200415