Multi-band automatic speech recognition

descriptionPublicationkeyboard_double_arrow_right Article 01 Apr 2001 France English Publisher:Elsevier BVJournal:Computer Speech & Language, volume 15, pages 151-174 (issn: 0885-2308,

Copyright policy )

Authors: Cerisara, Christophe; Fohr, Dominique;

doi: 10.1006/csla.2001.0163

Multi-band automatic speech recognition

- Summary
- Subjects
- Metrics

Abstract

This paper presents a new architecture for automatic speech recognition systems which is characterized by the division of the spectral domain of the speech signal into several independent frequency bands. This model is based on the psycho-acoustic work of Fletcher (1953) who proposed a similar principle for the human auditory system. Jont B. Allen published a paper in 1994 in which he summarized the work of Fletcher and also proposed to adapt the multi-band paradigm to automatic speech recognition (ASR) (Allen, 1994). Many researchers have then studied this principle and built such ASR systems. The goal of this paper is to analyse some of the most important issues in the design of a multi-band ASR system in order to determine which architecture it should have in which environment. Two other major problems are then considered: how to train multi-band systems and how to use them for continuous ASR.

Country

France

Related Organizations

Keywords

[INFO.INFO-OH] Computer Science [cs]/Other [cs.OH], multi-bandes, reconnaissance de la parole, speech recognition, multi-band

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average