Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Article . 2025
License: CC BY
Data sources: Datacite
ZENODO
Article . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

OPTIMIZED FILTER BANKS TO IMPROVE PUNJABI SPEECH RECOGNITION IN NOISY ENVIRONMENT

Authors: Pooja Rani; Ashish Saini; Vikas Mittal;

OPTIMIZED FILTER BANKS TO IMPROVE PUNJABI SPEECH RECOGNITION IN NOISY ENVIRONMENT

Abstract

The performance of Automatic Speech Recognition (ASR) depends on its capability to identify the test patterns best-matched with training patterns in various classes. This matching is highly dependent upon the individual feature extraction technique or combination thereof. Certain advanced feature extraction techniques such as GFCC, BFCC have been reported in the literature (with associated additional problems of accepted bandwidth and optimal number of features) in addition to the commonly used ones such as Mel Frequency Cepstral Coefficient (MFCC) and Perceptual Linear Prediction (PLP) coefficient. MFCC is more suitable for clean environments while PLP performs better when there lies a significant mismatch between training and testing phase. Therefore, this paper proposes a minimalistic approach involving hybrid features (i.e., MFCC+PLP) to overcome shortcomings of each constituent, such as sensitivity to background noise on one hand, and avoid complexity in extracting advanced features, such as GFCC and BFCC etc. on the other hand. These hybrid features can provide favourable or comparable results as compared to those obtained using advanced features in both clean and noisy environments. The other problem of optimizing the number of filter banks for a specified bandwidth is proposed to be accomplished using an evolutionary technique like DE (Differential Evolution) to enable suitable comparisons with the existing literature. Additionally, an advanced classifier viz. Deep Neural Networks (DNN) is used as compared to ones that are more conventional such as Hidden Markov Model (HMM) used in the literature for further improvisation.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green