Powered by OpenAIRE graph
Found an issue? Give us feedback

Создание Национального корпуса чувашского языка: проблемы и перспективы

Создание Национального корпуса чувашского языка: проблемы и перспективы

Abstract

In the paper is analyzed the problem of creating the National corpora of Chuvash language and the problems and perspectives linked with it. The national linguistic corporas include large arrays of electronic text of different genres and styles, which gives the possibility to investigate comprehensively and fully different language phenomenas. While lacking necessary financement is proposed not to seek the creation of a full database of Chuvash texts but to make a representative selection. Was composed a shortlist of computer software, necessary for the work with this textual database, were considered questions of elaboration of a tagging system of the corpora, as well as the provision of multiuser access through the Internet. Were also considered question of security. Was noted that the best strategy would be the use of separate server.

В статье рассматривается задача создания Национального корпуса чувашского языка и связанные с ней проблемы и перспективы. Национальные языковые корпуса включают в себя большие массивы электронных текстов разных жанров и стилей, что дает возможность всесторонне и полно исследовать различные языковые явления. В отсутствии необходимого финансирования предлагается не добиваться создания полной текстовой базы чувашских текстов, а сделать репрезентативную выборку. Составлен минимальный список компьютерных программ, необходимых для работы с этой текстовой базой данных, рассмотрены вопросы разработки разметки для корпуса, а также обеспечения многопользовательского доступа через Интернет. Также рассмотрены вопросы безопасности. Отмечено, что наиболее безопасным будет использование отдельного сервера.

Keywords

МНОГОПОЛЬЗОВАТЕЛЬСКИЙ ДОСТУП., ЭКСТРАЛИНГВИСТИЧЕСКАЯ И ЛИНГВИСТИЧЕСКАЯ РАЗМЕТКА, МАШИННЫЙ ФОНД, ЛИНГВИСТИЧЕСКИЙ КОРПУС, MULTIUSER ACCESS.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
gold