<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Neural Named Entity Recognition for Kazakh

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Preprint 01 Jan 2023Embargo end date: 01 Jan 2020Publisher:Springer Nature Switzerland

Authors: Tolegen, Gulmira; Toleu, Alymzhan; Mamyrbayev, Orken; Mussabayev, Rustam;

doi: 10.1007/978-3-031-24340-0_1 , 10.48550/arxiv.2007.13626

arXiv: http://arxiv.org/abs/2007.13626

Neural Named Entity Recognition for Kazakh

- Summary
- Subjects
- Metrics

Abstract

We present several neural networks to address the task of named entity recognition for morphologically complex languages (MCL). Kazakh is a morphologically complex language in which each root/stem can produce hundreds or thousands of variant word forms. This nature of the language could lead to a serious data sparsity problem, which may prevent the deep learning models from being well trained for under-resourced MCLs. In order to model the MCLs' words effectively, we introduce root and entity tag embedding plus tensor layer to the neural networks. The effects of those are significant for improving NER model performance of MCLs. The proposed models outperform state-of-the-art including character-based approaches, and can be potentially applied to other morphologically complex languages.

Related Organizations

Institute of Information and Computational Technologies
Kazakhstan

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL), Information Retrieval (cs.IR), Computer Science - Information Retrieval

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Top 10%

Average

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering