publication . Article . 2015

Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora

Karlheinz Mörth; Laurent Romary; Gerhard Budin; Daniel Schopper;
Open Access English
  • Published: 01 Dec 2015
  • Publisher: HAL CCSD
  • Country: France
Abstract
International audience; Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors addre...
Persistent Identifiers
Subjects
free text keywords: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], lexicography, language resources, digital corpora, statistics, Interoperability, Lexicography, Natural language processing, computer.software_genre, computer, Frequency data, Computer science, Personalization, Workflow, Data science, Writing system, Lexicographical order, Artificial intelligence, business.industry, business
Funded by
FWF| Arabic in the Middle Atlas Mountains (Morocco)
Project
  • Funder: Austrian Science Fund (FWF) (FWF)
  • Project Code: P 21722
  • Funding stream: Einzelprojekte
Communities
DARIAH EU
Digital Humanities and Cultural Heritage
Any information missing or wrong?Report an Issue