Inter-phone and inter-word distances for confusability prediction in speech recognition

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2004 English Publisher:Sociedad Española para el Procesamiento del Lenguaje NaturalJournal:Proces. del Leng. Natural, volume 33

Authors: Jan Anguita; Javier Hernando;

handle: 10045/1452

Inter-phone and inter-word distances for confusability prediction in speech recognition

- Summary
- Subjects
- Metrics

Abstract

In this work we investigate new inter-phone and inter-word distances and we apply them to predict if two words of the lexicon of an Automatic Speech Recognition (ASR) system are likely to be confused. The inter-word distance is calculated from an alignment between the phonetic transcriptions of the words by adding the distances between the aligned phones. We bring a new solution in which the inter-phone distance used for computing the inter-word distance is not the same used to compute the phonetic alignment. The first one is calculated between the acoustic models of the phones with a new formula that we propose. The second one is based on phonetic knowledge. We also use two different kinds of alignments: either with or without insertions and deletions. In order to evaluate the performances, we introduce a classical false acceptance/false rejection framework and the prediction Equal Error Rate (EER) was measured to be less than 2%.

En este trabajo se investigan nuevas distancias entre fonemas y entre palabras que se han usado para predecir si dos palabras del vocabulario de un sistema de reconocimiento del habla se van a confundir o no. La distancia entre palabras se calcula a partir de un alineamiento entre las transcripciones fonéticas de las palabras sumando las distancias entre los fonemas alineados. Se propone una nueva solución donde la distancia entre fonemas usada para alinear no es la misma que la que se usa para calcular la distancia entre palabras. La primera está basada en conocimiento fonético. La segunda se obtiene a partir de los modelos acústicos de los fonemas con una nueva fórmula que proponemos. También se han usado dos tipos de alineamientos: con o sin inserciones y omisiones. Para evaluar la predicción se han calculado las tasas de falso rechazo y falsa aceptación y se ha obtenido un Equal Error Rate de menos del 2%.

Keywords

Inter-word distance, Inter-phone distance, Distancia entre fonemas, Confusión, Confusability, Predicción, Prediction, Distancia entre palabras

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green