Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2026
License: CC BY
Data sources: ZENODO
ZENODO
Other literature type . 2026
License: CC BY
Data sources: Datacite
ZENODO
Other literature type . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Spamshield Sentiment Analysis on Youtube Comments

Authors: Prof.Supriya G Purohit; Ms.G Keerthi; Ms.Hafsa Zareen; Ms. Ayesha Hasan Osmani;

Spamshield Sentiment Analysis on Youtube Comments

Abstract

The exponential growth of social media platforms has resulted in an overwhelming volume of user-generated textual content, making effective content moderation and opinion analysis increasingly challenging. YouTube, as one of the most popular video-sharing platforms, receives millions of comments daily, which include genuine feedback as well as spam, promotional messages, and emotionally charged content. Manual analysis of such large-scale data is inefficient, time-consuming, and prone to inconsistencies. Therefore, there is a growing need for automated systems capable of analyzing user comments and extracting meaningful insights in real time. This paper presents SpamShield, a web-based automated system designed to analyze YouTube comments using Natural Language Processing (NLP) techniques. The proposed system retrieves real-time comments from YouTube videos using the YouTube Data API and performs comprehensive text preprocessing to remove noise and normalize the data. Preprocessing steps include text normalization, removal of special characters and URLs, tokenization, and stop-word elimination, ensuring that the comments are suitable for reliable analysis. The sentiment analysis process is implemented using the Natural Language Toolkit (NLTK) along with the VADER (Valence Aware Dictionary and sEntiment Reasoner) lexicon, hich is specifically optimized for analyzing social media text. Each comment is evaluated based on lexical and contextual features, and the sentiment polarity is classified into positive, negative, or neutral categories. The system further aggregates sentiment results to generate comprehensive sentiment reports that provide a clear overview of audience opinions and engagement patterns. Experimental evaluation demonstrates that the proposed system effectively analyzes informal and unstructured social media text, offering reliable sentiment classification and intuitive visualization of results. The modular and scalable architecture of SpamShield enables efficient processing of large volumes of comments and supports real-time analysis. The proposed approach provides a practical solution for content creators, marketers, and researchers to understand audience sentiment, enhance user engagement, and support data-driven decision- making in social media environments.

Keywords

YouTube Comment Analysis, Spam Detection, Sentiment Analysis, Natural Language Processing (NLP), Social Media Analytics.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green