Application of deep neural networks for automatic irony detection in Russian texts

descriptionPublicationkeyboard_double_arrow_right Article 28 Mar 2024Publisher:P.G. Demidov Yaroslavl State UniversityJournal:Modeling and Analysis of Information Systems, volume 31, pages 90-101 (issn: 1818-1015, eissn: 2313-5417,

Copyright policy )

Authors: Maksim A. Kosterin; Ilya V. Paramonov;

doi: 10.18255/1818-1015-2024-1-90-101

Application of deep neural networks for automatic irony detection in Russian texts

- Summary
- Subjects
- Metrics

Abstract

The paper examines automatic methods for classifying Russian-language sentences into two classes: ironic and non-ironic. The discussed methods can be divided into three categories: classifiers based on language model embeddings, classifiers using sentiment information, and classifiers with embeddings trained to detect irony. The components of classifiers are neural networks such as BERT, RoBERTa, BiLSTM, CNN, as well as an attention mechanism and fully connected layers. The irony detection experiments were carried out using two corpora of Russian sentences: the first corpus is composed of journalistic texts from the OpenCorpora open corpus, the second corpus is an extension of the first one and is supplemented with ironic sentences from the Wiktionary resource. The best results were demonstrated by a group of classifiers based on embeddings of language models with the maximum F-measure of 0.84, achieved by a combination of RoBERTa, BiLSTM, an attention mechanism and a pair of fully connected layers in experiments on the extended corpus. In general, using the extended corpus produced results that were 2–5% higher than those of the basic corpus. The achieved results are the best for the problem under consideration in the case of the Russian language and are comparable to the best one for English.

Related Organizations

Yaroslavl State University
Russian Federation

Keywords

neural network-based classifier, irony detection, deep learning, Information technology, natural language processing, T58.5-58.64, sarcasm detection, bert

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

gold

Fields of Science (4) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all

Related to Research communities

Digital Humanities and Cultural Heritage