publication . Conference object . 2018

Guessing lexicon entries using finite-state methods

Kimmo Koskenniemi;
Open Access English
  • Published: 01 Jan 2018
  • Publisher: The Association for Computational Linguistics
  • Country: Finland
Abstract
A practical method for interactive guessing of LEXC lexicon entries is presented. The method is based on describing groups of similarly inflected words using regular expressions. The patterns are compiled into a finite-state transducer (FST) which maps any word form into the possible LEXC lexicon entries which could generate it. The same FST can be used (1) for converting conventional headword lists into LEXC entries, (2) for interactive guessing of entries, (3) for corpus-assisted interactive guessing and (4) guessing entries from corpora. A method of representing affixes as a table is presented as well how the tables can be converted into LEXC format for sever...
Subjects
free text keywords: 6121 Languages, computational linguistics, language technology, finite-state methods, lexicon, 113 Computer and information sciences, natural language processing
Related Organizations
Communities
Digital Humanities and Cultural Heritage
Powered by OpenAIRE Research Graph
Any information missing or wrong?Report an Issue