Weakly Supervised Deep Learning for Arabic Tweet Sentiment Analysis on Education Reforms: Leveraging Pre-Trained Models and LLMs With Snorkel

Name: Weakly Supervised Deep Learning for Arabic Tweet Sentiment Analysis on Education Reforms: Leveraging Pre-Trained Models and LLMs With Snorkel
Keywords: AraBERT, large language models, Electrical engineering. Electronics. Nuclear engineering, natural language processing, weakly training data, social media data, weak supervision, TK1-9971

Alanoud Alotaibi; Farrukh Nadeem; Mohamed Hamdy

Found an issue? Give us feedback

IEEE Accessarrow_drop_down

IEEE Access

Article . 2025 . Peer-reviewed

License: CC BY

Data sources: Crossref

IEEE Access

Article . 2025

Data sources: DOAJ

Weakly Supervised Deep Learning for Arabic Tweet Sentiment Analysis on Education Reforms: Leveraging Pre-Trained Models and LLMs With Snorkel

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2025Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Access, volume 13, pages 30,523-30,542 (eissn: 2169-3536,

Copyright policy )

Authors: Alanoud Alotaibi; Farrukh Nadeem; Mohamed Hamdy;

doi: 10.1109/access.2025.3541154

Weakly Supervised Deep Learning for Arabic Tweet Sentiment Analysis on Education Reforms: Leveraging Pre-Trained Models and LLMs With Snorkel

- Summary
- Subjects
- Metrics

Abstract

This study introduces a novel approach to sentiment classification of Arabic tweets regarding educational reforms in Saudi Arabia. The complexity of the Arabic language, with its numerous dialects, poses challenges for natural language processing tasks, particularly when large volumes of data require manual annotation. To overcome the limitations of traditional labeling methods, we developed a weakly supervised learning framework that combines LLMs (GPT-3.5) and pre-trained language models (MarBERT and XLM-RoBERTa) to generate high-quality weakly labeled training data using the Snorkel framework. We fine-tuned the AraBERT model with this weakly labeled data for sentiment classification. Our experimental results demonstrated the effectiveness of the proposed approach, achieving 83% precision, 76% recall, and an 85% F1 score in classifying tweets as positive, negative, or neutral. Comparative analysis showed that GPT-3.5 outperformed Llama 2 in prompting tasks, and our weakly supervised model surpassed baseline machine learning methods. These findings highlight the potential of weakly supervised learning in analyzing public opinion on Arabic social media platforms without relying on large, labeled datasets.

Related Organizations

Imam Muhammad ibn Saud Islamic University
Saudi Arabia
King Abdulaziz University
Saudi Arabia

Keywords

AraBERT, large language models, Electrical engineering. Electronics. Nuclear engineering, natural language processing, weakly training data, social media data, weak supervision, TK1-9971

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

gold

Related to Research communities

Digital Humanities and Cultural Heritage