descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Apr 2018Embargo end date: 01 Jan 2017Publisher:IEEEJournal:2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Authors: Rethage, Dario; Pons Puig, Jordi; Serra, Xavier;

doi: 10.1109/icassp.2018.8462417 , 10.48550/arxiv.1706.07162

arXiv: http://arxiv.org/abs/1706.07162

handle: 10230/35669

A Wavenet for Speech Denoising

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Currently, most speech processing techniques use magnitude spectrograms as front-end and are therefore by default discarding part of the signal: the phase. In order to overcome this limitation, we propose an end-to-end learning method for speech denoising based on Wavenet. The proposed model adaptation retains Wavenet's powerful acoustic modeling capabilities, while significantly reducing its time-complexity by eliminating its autoregressive nature. Specifically, the model makes use of non-causal, dilated convolutions and predicts target fields instead of a single target sample. The discriminative adaptation of the model we propose, learns in a supervised fashion via minimizing a regression loss. These modifications make the model highly parallelizable during both training and inference. Both computational and perceptual evaluations indicate that the proposed method is preferred to Wiener filtering, a common method based on processing the magnitude spectrogram.

In proceedings of the 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018). Code: https://github.com/drethage/speech-denoising-wavenet - Audio examples: http://jordipons.me/apps/speech-denoising-wavenet/

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Sound

1 Research products, page 1 of 1

speech-denoising-wavenet software on GitHub
IsRelatedTo

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	271
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.1%