An Open Dataset of Synthetic Speech

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 04 Dec 2023Publisher:IEEEJournal:2023 IEEE International Workshop on Information Forensics and Security (WIFS)Funded by:EC | vera.ai, EC | AI4Media

Authors: Yaroshchuk, Artem; Papastergiopoulos, Christoforos; Cuccovillo, Luca; Aichroth, Patrick; Konstantinos, Konstantinos; Tzovaras, Dimitrios;

doi: 10.1109/wifs58808.2023.10374863 , 10.5281/zenodo.10124946 , 10.5281/zenodo.10124945

An Open Dataset of Synthetic Speech

- Summary
- Metrics

Abstract

This paper introduces a multilingual, multispeaker dataset composed of synthetic and natural speech, designed to foster research and benchmarking in synthetic speech detection. The dataset encompasses 18,993 audio utterances synthesized from text, alongside with their corresponding natural equivalents, representing approximately 17 hours of synthetic audio data. The dataset features synthetic speech generated by 156 voices spanning three languages, namely, English, German, and Spanish, with a balanced gender representation. It targets state-of-the-art synthesis methods, and has been released with a license allowing seamless extension and redistribution by the research community.

The final version of the paper published by IEEE is available online at https://doi.org/10.1109/WIFS58808.2023.10374863.

Related Organizations

Fraunhofer Society
Germany
Centre for Research and Technology Hellas
Greece
Fraunhofer Institute for Digital Media Technology
Germany

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

4

Top 10%

Average

Top 10%

Green

Funded by

EC| vera.ai, EC| AI4Media