Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Recolector de Cienci...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
DBLP
Article . 2004
Data sources: DBLP
versions View all 2 versions
addClaim

Inter-phone and inter-word distances for confusability prediction in speech recognition

Authors: Jan Anguita; Javier Hernando;

Inter-phone and inter-word distances for confusability prediction in speech recognition

Abstract

In this work we investigate new inter-phone and inter-word distances and we apply them to predict if two words of the lexicon of an Automatic Speech Recognition (ASR) system are likely to be confused. The inter-word distance is calculated from an alignment between the phonetic transcriptions of the words by adding the distances between the aligned phones. We bring a new solution in which the inter-phone distance used for computing the inter-word distance is not the same used to compute the phonetic alignment. The first one is calculated between the acoustic models of the phones with a new formula that we propose. The second one is based on phonetic knowledge. We also use two different kinds of alignments: either with or without insertions and deletions. In order to evaluate the performances, we introduce a classical false acceptance/false rejection framework and the prediction Equal Error Rate (EER) was measured to be less than 2%.

En este trabajo se investigan nuevas distancias entre fonemas y entre palabras que se han usado para predecir si dos palabras del vocabulario de un sistema de reconocimiento del habla se van a confundir o no. La distancia entre palabras se calcula a partir de un alineamiento entre las transcripciones fonéticas de las palabras sumando las distancias entre los fonemas alineados. Se propone una nueva solución donde la distancia entre fonemas usada para alinear no es la misma que la que se usa para calcular la distancia entre palabras. La primera está basada en conocimiento fonético. La segunda se obtiene a partir de los modelos acústicos de los fonemas con una nueva fórmula que proponemos. También se han usado dos tipos de alineamientos: con o sin inserciones y omisiones. Para evaluar la predicción se han calculado las tasas de falso rechazo y falsa aceptación y se ha obtenido un Equal Error Rate de menos del 2%.

Keywords

Inter-word distance, Inter-phone distance, Distancia entre fonemas, Confusión, Confusability, Predicción, Prediction, Distancia entre palabras

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green