Arabic Offensive Language Classification: Leveraging Transformer, LSTM, and SVM

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 14 Dec 2023Publisher:IEEEJournal:2023 IEEE International Conference on Machine Learning and Applied Network Technologies (ICMLANT)

Authors: Rasheed, Areeg Fahad; Zarkoosh, M.; Abbas, Safa; Sabah Al-Azzawi, Sana;

doi: 10.1109/icmlant59547.2023.10372866

Arabic Offensive Language Classification: Leveraging Transformer, LSTM, and SVM

- Summary
- Subjects
- Metrics

Abstract

Social media platforms have become indispensable parts of our lives, offering avenues to share news, thoughts, and updates, as well as connect with new friends and explore various fields of knowledge. However, these platforms also harbor challenges, as they can inadvertently propagate hate speech and offensive content. Arabic, being the sixth most spoken language globally and widely used in over 22 countries, requires special attention to control and prevent the spread of hate speech. The core objective of this study is to develop an improved Arabic model for classifying offensive content, achieved by merging multiple Arabic hate and offensive datasets, including Iraqi offensive samples. Three distinct strategies were used: support vector machine (SVM), long short-term memory (LSTM), and the AraBERT transformer model. The used models were evaluated using recall, precision, F1-score, and accuracy metrics. Notably, the transformer model consistently outperformed the others across all metrics, showcasing its superior performance. Moreover, each dataset underwent assessment using the three models, consistently revealing the transformer's heightened efficiency.

Related Organizations

Keywords

Transformer, LSTM and SVM NLP Machine learning Transformer SVM offensive language, SVM, Machine learning, offensive language, and SVM NLP, [INFO] Computer Science [cs], LSTM

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Beta

SDGs Suggest

10. No inequality

Beta

SDGs:

10. No inequality,

Related to Research communities

Knowmad Institut

UArctic