Revisiting semi-continuous hidden Markov models

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Mar 2012Publisher:IEEEJournal:2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Authors: Korbinian Riedhammer; Tobias Bocklet; Arnab Ghoshal; Daniel Povey;

doi: 10.1109/icassp.2012.6288973

Revisiting semi-continuous hidden Markov models

- Summary
- Metrics

Abstract

In the past decade, semi-continuous hidden Markov models (SCHMMs) have not attracted much attention in the speech recognition community. Growing amounts of training data and increasing sophistication of model estimation led to the impression that continuous HMMs are the best choice of acoustic model. However, recent work on recognition of under-resourced languages faces the same old problem of estimating a large number of parameters from limited amounts of transcribed speech. This has led to a renewed interest in methods of reducing the number of parameters while maintaining or extending the modeling capabilities of continuous models. In this work, we compare classic and multiple-codebook semi-continuous models using diagonal and full covariance matrices with continuous HMMs and subspace Gaussian mixture models. Experiments on the RM and WSJ corpora show that while a classical semicontinuous system does not perform as well as a continuous one, multiple-codebook semi-continuous systems can perform better, particular when using full-covariance Gaussians.

Related Organizations

University of Erlangen-Nuremberg
Germany
University of Edinburgh
United Kingdom
Microsoft (United States)
United States
Saarland University
Germany

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	10
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%