<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
The speech command dataset facilitates human-robot vocal communication. It consists of speech commands recorded with a Telegram bot through crowdsourcing and with the microphones equipped by the robot and the adaptive workstation. The dataset also includes synthetic samples produced with text-to-speech services and negative samples that reproduce “normal” speech of workers during their assembly operations. To reproduce the typical noisy environment of the assembly line, an augmentation procedure allows the addition of random noise, collected in real industrial sites, with different SNRs on the voice samples. Deployment environment: The dataset includes voice samples recorded by real people with the microphone installed on board the robot and/or the adaptive workstation and/or with the Telegram bot. In addition, synthetic samples are produced with text to speech algorithms. Finally, an automatic augmentation procedure allows the addition of random noise, with variable SNRs, to the voice samples, in order to reproduce different types of industrial noise. Data acquisition: The samples are collected with the Telegram bot available at this link: https://t.me/speechcommand_bot. The use of a widespread open-source tool like Telegram allows to collect a large amount of data, from a conspicuous number of people, in a short time. In addition, speech commands have been collected with the microphones installed on board the robot and the adaptive workstation in the CRF use case. Ground truths are double-checked by experts. MIVIA Speech Command: The dataset can be split into two parts: Training and Validation Sets: These subsets used for training and validation are available in two versions: With synthetic samples: speech_command_dataset_with_synth.zip Without synthetic samples: speech_command_dataset_without_synth.zip Test Set: This subset contains only real samples collected from real-world scenarios, specifically within CRF.
Speech command recognition
Speech command recognition
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |