<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Data for the VoiceMOS Challenge 2022

Research datakeyboard_double_arrow_right Dataset 23 May 2022Publisher:Zenodo

Authors: Erica Cooper; Wen-Chin Huang; Tomoki Toda; Junichi Yamagishi;

doi: 10.5281/zenodo.6572572 , 10.5281/zenodo.6572573 , 10.5281/zenodo.10691660

Data for the VoiceMOS Challenge 2022

- Summary
- Metrics

Abstract

This is the public release of the data for the first VoiceMOS Challenge. The challenge had two tracks: a main track and an out-of-domain (OOD) track. The data for the main track is known as the BVCC dataset, and contains samples from past Blizzard Challenges, Voice Conversion Challenges, and public samples from ESPnet-TTS, along with their mean opinion score (MOS) ratings collected in one unified listening test. Standard training/development/testing splits from the challenge are also provided. The OOD track contains samples from the Blizzard Challenge 2019 along with their ratings from the original, separate listening test. We also include the scoring scripts that were used for the challenge. Samples from Blizzard Challenges may NOT be redistributed. Blizzard samples are not included in this dataset, but the scripts to download and preprocess them are included. Please run all of the included scripts to obtain the full dataset. BVCC reference: Erica Cooper and Junichi Yamagishi, "How do Voices from Past Speech Synthesis Challenges Compare Today?" SSW 2021. https://arxiv.org/abs/2105.02373 The VoiceMOS Challenge: Wen-Chin Huang, Erica Cooper, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi, "The VoiceMOS Challenge 2022," submitted to Interspeech 2022. https://arxiv.org/abs/2203.11389 https://voicemos-challenge-2022.github.io The Blizzard Challenges 2008, 2009, 2010, 2011, 2013, 2016, 2019: V. Karaiskos, S. King, R. A. Clark, and C. Mayo, "The Blizzard Challenge 2008," in Proc. Blizzard Challenge Workshop, 2008. A. W. Black, S. King, and K. Tokuda, "The Blizzard Challenge 2009," in Proc. Blizzard Challenge, 2009. S. King and V. Karaiskos, "The Blizzard Challenge 2010," 2010. S. King and V. Karaiskos, "The Blizzard Challenge 2011," 2011. S. King and V. Karaiskos, "The Blizzard Challenge 2013," 2013. S. King and V. Karaiskos, "The Blizzard Challenge 2016," 2016. Z. Wu, Z. Xie, and S. King, "The Blizzard Challenge 2019," 2019. The Voice Conversion Challenges 2016, 2018, and 2020: T. Toda, L.-H. Chen, D. Saito, F. Villavicencio, M. Wester, Z. Wu, and J. Yamagishi, "The Voice Conversion Challenge 2016," Interspeech, 2016. J. Lorenzo-Trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio, T. Kinnunen, and Z. Ling, "The Voice Conversion Challenge 2018: Promoting development of parallel and nonparallel methods." Z. Yi, W.-C. Huang, X. Tian, J. Yamagishi, R. K. Das, T. Kinnunen, Z. Ling, and T. Toda, "Voice Conversion Challenge 2020 — intra-lingual semi-parallel and cross-lingual voice conversion —," in Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020, pp. 80–98. ESPnet-TTS: S. Watanabe, T. Hori, S. Karita, T. Hayashi, J. Nishitoba, Y. Unno, N. Enrique Yalta Soplin, J. Heymann, M. Wiesner, N. Chen, A. Renduchintala, and T. Ochiai, "ESPnet: End-to-end speech processing toolkit," in Proceedings of Interspeech, 2018, pp. 2207–2211. [Online]. Available: http://dx.doi.org/10.21437/ Interspeech.2018- 1456

{"references": ["Cooper, Erica and Yamagishi, Junichi. \"How do Voices from Past Speech Synthesis Challenges Compare Today?\" SSW 2021. https://arxiv.org/abs/2105.02373", "Huang, Wen-Chin et al. \"\"The VoiceMOS Challenge 2022,\" https://arxiv.org/abs/2203.11389", "V. Karaiskos et al. \"The Blizzard Challenge 2008,\" in Proc. Blizzard Challenge Workshop, 2008.", "A. W. Black et al. \"The Blizzard Challenge 2009,\" in Proc. Blizzard Challenge, 2009.", "S. King and V. Karaiskos, \"The Blizzard Challenge 2010,\" 2010.", "S. King and V. Karaiskos, \"The Blizzard Challenge 2011,\" 2011.", "S. King and V. Karaiskos, \"The Blizzard Challenge 2013,\" 2013.", "S. King and V. Karaiskos, \"The Blizzard Challenge 2016,\" 2016.", "Z. Wu et al. \"The Blizzard Challenge 2019,\" 2019.", "T. Toda et al. \"The Voice Conversion Challenge 2016,\" Interspeech, 2016.", "J. Lorenzo-Trueba et al. \"The Voice Conversion Challenge 2018: Promoting development of parallel and nonparallel methods.\"", "Z. Yi et al. \"Voice Conversion Challenge 2020 \u2014 intra-lingual semi-parallel and cross-lingual voice conversion \u2014,\" in Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020, pp. 80\u201398.", "S. Watanabe et al. \"ESPnet: End-to-end speech processing toolkit,\" in Proceedings of Interspeech, 2018, pp. 2207\u20132211. [Online]. Available: http://dx.doi.org/10.21437/ Interspeech.2018- 1456"]}

Related Organizations

Nagoya University
Japan
National Institute of Informatics (NII)
Japan

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Usage byUsageCounts

visibility	views	638
download	downloads	373

638
views
373
downloads
Powered by

Found an issue? Give us feedback

visibility

download

Average

638

373