Unsupervised Automatic Speech Recognition: A review

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Apr 2022Embargo end date: 01 Jan 2021 English Publisher:Elsevier BVJournal:Speech Communication, volume 139, pages 76-91 (issn: 0167-6393,

Copyright policy )

Authors: Hanan Aldarmaki; Asad Ullah; Sreepratha Ram; Nazar Zaki;

doi: 10.1016/j.specom.2022.02.005 , 10.48550/arxiv.2106.04897

arXiv: 2106.04897

Unsupervised Automatic Speech Recognition: A review

- Summary
- Subjects
- Metrics

Abstract

Automatic Speech Recognition (ASR) systems can be trained to achieve remarkable performance given large amounts of manually transcribed speech, but large labeled data sets can be difficult or expensive to acquire for all languages of interest. In this paper, we review the research literature to identify models and ideas that could lead to fully unsupervised ASR, including unsupervised segmentation of the speech signal, unsupervised mapping from speech segments to text, and semi-supervised models with nominal amounts of labeled examples. The objective of the study is to identify the limitations of what can be learned from speech data alone and to understand the minimum requirements for speech recognition. Identifying these limitations would help optimize the resources and efforts in ASR development for low-resource languages.

26 pages + 10 pages of references, 3 figures. Speech Communication (2022)

Related Organizations

National University of Sciences and Technology
Pakistan
Al Ain University of Science and Technology
United Arab Emirates
Islamic Azad University, UAE Branch
United Arab Emirates

Keywords

FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	56
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

56

Top 1%

Top 10%

Top 1%

Green

hybrid

Fields of Science (4) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all