Downloads provided by UsageCounts
{"references": ["L. Deng, X. Huang, \"Challenges in adopting speech recognition\", Communications of ACM, vol. 47(1), ACM, New York, pp. 69\u201375, 2004.", "G. \u010ceidait\u0117, L. Telksnys, \"Analysis of factors influencing accuracy of speech recognition\", Electronics and Electrical Engineering, no. 9(105), pp. 69\u201372, 2010.", "Ch.-P. Chen, Noise robustness in automatic speech recognition, Ph. D. thesis, University of Washington, 2004.", "J. Benesty, S. Makino, J. Chen, Speech enhancement, Berlin: Springer-Verlag, 2005.", "M. Seltzer, M. Microphone array processing for robust speech recognition, Ph. D. thesis, Carnegie Mellon University, Pittsburgh, 2003.", "R. Talmon, I. Cohen, and Sh. Gannot, \"Transient noise reduction using nonlocal diffusion filters\", IEEE Trans. Audio, Speech, and Language Processing, vol. 19(6), pp. 1584 \u2013 1599, 2011.", "R. Gomez and T. Kawahara, \"Optimizing spectral subtraction and Wiener filtering for robust speech recognition in reverberant and noisy conditions\", in Proc. of ICASSP, pp. 4566\u20134569, 2010.", "H. Hermansky, \"Perceptual linear predictive (PLP) analysis of speech\", Journal of Acoustical Society of America, vol. 87(4), pp. 1738\u20131752, 1990.", "D.-S. Kim, S.-Y. Lee, and R. M. Kil, \"Auditory processing of speech signals for robust speech recognition in real-world noisy environments\", IEEE Trans.Speech and Audio Processing, vol. 7(1), pp. 55\u201369, 1999.\n[10]\tY. Shao, Zh. Jin, D. Wang, and S. Srinivasan, \"An auditory-based feature for robust speech recognition\", in Proc. of ICASSP, pp. 4625\u20134628, 2009.\n[11]\tCh. Kim and R. M. Stern, \"Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction\", in INTERSPEECH 2010, pp. 2058\u20132061, 2010.\n[12]\tD. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero, \"A Minimum-mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition\", in Proc. of ICASSP, pp. 4041\u20134044, 2008. \n[13]\tM. Fujimoto, S. Watanabe, and T. Nakatani, \"Non-stationary noise estimation method on bias-residual component decomposition for robust speech recognition\", in Proc. of ICASSP, pp. 4816\u20134819, 2011.\n[14]\tS.V. Vaseghi, Advanced digital signal processing and noise reduction, New York: Wiley, 2006.\n[15]\tH. Sakoe and S. Chiba, \"Dynamic programming algorithm optimization for spoken word recognition\", IEEE Trans.Speech and Audio Processing, vol. 26(1), pp. 43\u201349, 1978.\n[16]\tL. Rabiner and B.-H. Juang Fundamentals of speech recognition, New Jersey: Prentice-Hall, 1993.\n[17]\tT. Sledevi\u010d, D. Navakauskas, \"FPGA based fast Lithuanian isolated word recognition system\", in Proc. of EUROCON 2012, pp. 1630\u20131636, 2013.\n[18]\tT. Sledevi\u010d, G. Tamulevi\u010dius, D. Navakauskas, \"Upgrading FPGA implementation of isolated word recognition system for a real-time operation\", Electronics and Electrical Engineering, no. 10(19), pp. 123\u2013128, 2013."]}
We consider the biggest challenge in speech recognition – noise reduction. Traditionally detected transient noise pulses are removed with the corrupted speech using pulse models. In this paper we propose to cope with the problem directly in Dynamic Time Warping domain. Bidirectional Dynamic Time Warping algorithm for the recognition of isolated words impacted by transient noise pulses is proposed. It uses simple transient noise pulse detector, employs bidirectional computation of dynamic time warping and directly manipulates with warping results. Experimental investigation with several alternative solutions confirms effectiveness of the proposed algorithm in the reduction of impact of noise on recognition process – 3.9% increase of the noisy speech recognition is achieved.
noise reduction, dynamic time warping, Transient noise pulses, speech recognition.
noise reduction, dynamic time warping, Transient noise pulses, speech recognition.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 4 | |
| downloads | 2 |

Views provided by UsageCounts
Downloads provided by UsageCounts