Downloads provided by UsageCounts
This contains phoneme alignments and word alignments (= labels for each timestep) for all 980 hours of LibriSpeech. We obtained these alignments using the Montreal Forced Aligner, using their pre-trained LibriSpeech acoustic model. To make it easy to replicate the experiments in our paper, we provide these alignments, so you don't need to run the aligner yourself. Note that for a small number of audio files, the aligner could not compute an alignment; we did not use these audios during training. If you find these alignments or other parts of our experiment useful, please cite our paper: Loren Lugosch, Mirco Ravanelli, Patrick Ignoto, Vikrant Singh Tomar, and Yoshua Bengio, "Speech Model Pre-training for End-to-End Spoken Language Understanding", Interspeech 2019. as well as the Montreal Forced Aligner paper: Michael McAuliffe, Michaela Socolof, Sarah Mihuc, Michael Wagner, and Morgan Sonderegger. "Montreal Forced Aligner: trainable text-speech alignment using Kaldi", Interspeech 2017.
phonemes, speech, speech recognition, alignments
phonemes, speech, speech recognition, alignments
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 900 | |
| downloads | 359 |

Views provided by UsageCounts
Downloads provided by UsageCounts