Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ https://doi.org/10.5...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
https://doi.org/10.5281/zenodo...
Dataset . 2025
License: CC BY
Data sources: Sygma
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 3 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

MIVIA Speech Command (FELICE Project)

Authors: Department of Information Engineering, Electrical Engineering, and Applied Mathematics (DIEM); Vento, Mario; Saggese, Alessia; Carletti, Vincenzo; Greco, Antonio; Ritrovato, Pierluigi; Rosa, Francesco; +1 Authors

MIVIA Speech Command (FELICE Project)

Abstract

The speech command dataset facilitates human-robot vocal communication. It consists of speech commands recorded with a Telegram bot through crowdsourcing and with the microphones equipped by the robot and the adaptive workstation. The dataset also includes synthetic samples produced with text-to-speech services and negative samples that reproduce “normal” speech of workers during their assembly operations. To reproduce the typical noisy environment of the assembly line, an augmentation procedure allows the addition of random noise, collected in real industrial sites, with different SNRs on the voice samples. Deployment environment: The dataset includes voice samples recorded by real people with the microphone installed on board the robot and/or the adaptive workstation and/or with the Telegram bot. In addition, synthetic samples are produced with text to speech algorithms. Finally, an automatic augmentation procedure allows the addition of random noise, with variable SNRs, to the voice samples, in order to reproduce different types of industrial noise. Data acquisition: The samples are collected with the Telegram bot available at this link: https://t.me/speechcommand_bot. The use of a widespread open-source tool like Telegram allows to collect a large amount of data, from a conspicuous number of people, in a short time. In addition, speech commands have been collected with the microphones installed on board the robot and the adaptive workstation in the CRF use case. Ground truths are double-checked by experts. MIVIA Speech Command: The dataset can be split into two parts: Training and Validation Sets: These subsets used for training and validation are available in two versions: With synthetic samples: speech_command_dataset_with_synth.zip Without synthetic samples: speech_command_dataset_without_synth.zip Test Set: This subset contains only real samples collected from real-world scenarios, specifically within CRF.

Related Organizations
Keywords

Speech command recognition

  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Funded by