<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

MIVIA Speech Command (FELICE Project)

Name: MIVIA Speech Command (FELICE Project)
Keywords: Speech command recognition

Research datakeyboard_double_arrow_right Dataset 30 Jan 2025 Italian Publisher:ZenodoFunded by:EC | FELICE

Authors: Department of Information Engineering, Electrical Engineering, and Applied Mathematics (DIEM); Vento, Mario; Saggese, Alessia; Carletti, Vincenzo; Greco, Antonio; Ritrovato, Pierluigi; Rosa, Francesco; +1 Authors

doi: 10.5281/zenodo.14771083 , 10.5281/zenodo.14539278

MIVIA Speech Command (FELICE Project)

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

The speech command dataset facilitates human-robot vocal communication. It consists of speech commands recorded with a Telegram bot through crowdsourcing and with the microphones equipped by the robot and the adaptive workstation. The dataset also includes synthetic samples produced with text-to-speech services and negative samples that reproduce “normal” speech of workers during their assembly operations. To reproduce the typical noisy environment of the assembly line, an augmentation procedure allows the addition of random noise, collected in real industrial sites, with different SNRs on the voice samples. Deployment environment: The dataset includes voice samples recorded by real people with the microphone installed on board the robot and/or the adaptive workstation and/or with the Telegram bot. In addition, synthetic samples are produced with text to speech algorithms. Finally, an automatic augmentation procedure allows the addition of random noise, with variable SNRs, to the voice samples, in order to reproduce different types of industrial noise. Data acquisition: The samples are collected with the Telegram bot available at this link: https://t.me/speechcommand_bot. The use of a widespread open-source tool like Telegram allows to collect a large amount of data, from a conspicuous number of people, in a short time. In addition, speech commands have been collected with the microphones installed on board the robot and the adaptive workstation in the CRF use case. Ground truths are double-checked by experts. MIVIA Speech Command: The dataset can be split into two parts: Training and Validation Sets: These subsets used for training and validation are available in two versions: With synthetic samples: speech_command_dataset_with_synth.zip Without synthetic samples: speech_command_dataset_without_synth.zip Test Set: This subset contains only real samples collected from real-world scenarios, specifically within CRF.

Related Organizations

Università degli studi di Salerno
Italy

Keywords

Speech command recognition

1 Research products, page of 1

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Funded by

EC| FELICE