Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2014
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2014
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2014
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2014
License: CC BY
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2014
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2014
License: CC BY
Data sources: ZENODO
versions View all 4 versions
addClaim

Bidirectional Dynamic Time Warping Algorithm for the Recognition of Isolated Words Impacted by Transient Noise Pulses

Authors: G. Tamulevičius; A. Serackis; T. Sledevič; D. Navakauskas;

Bidirectional Dynamic Time Warping Algorithm for the Recognition of Isolated Words Impacted by Transient Noise Pulses

Abstract

{"references": ["L. Deng, X. Huang, \"Challenges in adopting speech recognition\", Communications of ACM, vol. 47(1), ACM, New York, pp. 69\u201375, 2004.", "G. \u010ceidait\u0117, L. Telksnys, \"Analysis of factors influencing accuracy of speech recognition\", Electronics and Electrical Engineering, no. 9(105), pp. 69\u201372, 2010.", "Ch.-P. Chen, Noise robustness in automatic speech recognition, Ph. D. thesis, University of Washington, 2004.", "J. Benesty, S. Makino, J. Chen, Speech enhancement, Berlin: Springer-Verlag, 2005.", "M. Seltzer, M. Microphone array processing for robust speech recognition, Ph. D. thesis, Carnegie Mellon University, Pittsburgh, 2003.", "R. Talmon, I. Cohen, and Sh. Gannot, \"Transient noise reduction using nonlocal diffusion filters\", IEEE Trans. Audio, Speech, and Language Processing, vol. 19(6), pp. 1584 \u2013 1599, 2011.", "R. Gomez and T. Kawahara, \"Optimizing spectral subtraction and Wiener filtering for robust speech recognition in reverberant and noisy conditions\", in Proc. of ICASSP, pp. 4566\u20134569, 2010.", "H. Hermansky, \"Perceptual linear predictive (PLP) analysis of speech\", Journal of Acoustical Society of America, vol. 87(4), pp. 1738\u20131752, 1990.", "D.-S. Kim, S.-Y. Lee, and R. M. Kil, \"Auditory processing of speech signals for robust speech recognition in real-world noisy environments\", IEEE Trans.Speech and Audio Processing, vol. 7(1), pp. 55\u201369, 1999.\n[10]\tY. Shao, Zh. Jin, D. Wang, and S. Srinivasan, \"An auditory-based feature for robust speech recognition\", in Proc. of ICASSP, pp. 4625\u20134628, 2009.\n[11]\tCh. Kim and R. M. Stern, \"Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction\", in INTERSPEECH 2010, pp. 2058\u20132061, 2010.\n[12]\tD. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero, \"A Minimum-mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition\", in Proc. of ICASSP, pp. 4041\u20134044, 2008. \n[13]\tM. Fujimoto, S. Watanabe, and T. Nakatani, \"Non-stationary noise estimation method on bias-residual component decomposition for robust speech recognition\", in Proc. of ICASSP, pp. 4816\u20134819, 2011.\n[14]\tS.V. Vaseghi, Advanced digital signal processing and noise reduction, New York: Wiley, 2006.\n[15]\tH. Sakoe and S. Chiba, \"Dynamic programming algorithm optimization for spoken word recognition\", IEEE Trans.Speech and Audio Processing, vol. 26(1), pp. 43\u201349, 1978.\n[16]\tL. Rabiner and B.-H. Juang Fundamentals of speech recognition, New Jersey: Prentice-Hall, 1993.\n[17]\tT. Sledevi\u010d, D. Navakauskas, \"FPGA based fast Lithuanian isolated word recognition system\", in Proc. of EUROCON 2012, pp. 1630\u20131636, 2013.\n[18]\tT. Sledevi\u010d, G. Tamulevi\u010dius, D. Navakauskas, \"Upgrading FPGA implementation of isolated word recognition system for a real-time operation\", Electronics and Electrical Engineering, no. 10(19), pp. 123\u2013128, 2013."]}

We consider the biggest challenge in speech recognition – noise reduction. Traditionally detected transient noise pulses are removed with the corrupted speech using pulse models. In this paper we propose to cope with the problem directly in Dynamic Time Warping domain. Bidirectional Dynamic Time Warping algorithm for the recognition of isolated words impacted by transient noise pulses is proposed. It uses simple transient noise pulse detector, employs bidirectional computation of dynamic time warping and directly manipulates with warping results. Experimental investigation with several alternative solutions confirms effectiveness of the proposed algorithm in the reduction of impact of noise on recognition process – 3.9% increase of the noisy speech recognition is achieved.

Keywords

noise reduction, dynamic time warping, Transient noise pulses, speech recognition.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 4
    download downloads 2
  • 4
    views
    2
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
4
2
Green