Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Journal . 2026
License: CC BY
Data sources: ZENODO
ZENODO
Journal . 2026
License: CC BY
Data sources: Datacite
ZENODO
Journal . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

CONTEXT-AWARE NOISE SUPPRESSION USING MULTIMODAL AI

Authors: Khushi Ajay Vishwakarma & Khushi Kamlesh Soni;

CONTEXT-AWARE NOISE SUPPRESSION USING MULTIMODAL AI

Abstract

Noise suppression and enhancement technologies play a vital role in modern communication systems, especially in video conferencing platforms such as Google Meet, online collaboration tools, and virtual learning environments. Traditional adaptive noise cancellation methods rely mainly on unimodal audio input and low-level acoustic processing, which often proves insufficient in complex real-world environments, leading to the loss of meaningful auditory information. This paper proposes a context-aware noise suppression framework based on multimodal artificial intelligence to overcome these limitations. The framework integrates audio, visual, and motion-based contextual information to enable semantic-level understanding of sound sources. Audio signals are analyzed using speech and noise classification models, while visual and motion inputs assist in determining spatial orientation and contextual relevance. A unified decision mechanism conceptually determines whether sounds should be preserved or suppressed based on surrounding context. The proposed approach is expected to improve speech clarity, enhance user focus, and maintain environmental awareness. It is particularly relevant for applications such as video conferencing, wireless headphones, smart earbuds, assistive hearing devices, gaming headsets, and safety-critical communication systems, highlighting the importance of multimodal intelligence in next-generation noise suppression technologies.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average