Subject: Computer Science - Sound | Electrical Engineering and Systems Science - Audio and Speech Processing
Mean square error (MSE) has been the preferred choice as loss function in the current deep neural network (DNN) based speech separation techniques. In this paper, we propose a new cost function with the aim of optimizing the extended short time objective intelligibility... View more
 S. T. Roweis, “One microphone source separation,” in Advances in Neural Information Processing Systems, 2001, pp. 793-799.
 T. Virtanen, “Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 3, pp. 1066-1074, 2007.
 M. N. Schmidt and R. K. Olsson, “Single-channel speech separation using sparse non-negative matrix factorization,” in Ninth International Conference on Spoken Language Processing, 2006.
 P. Huang, M. Kim, M. Hasegawa-Johnson, and P. Smaragdis, “Deep learning for monaural speech separation,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2014, pp. 1562-1566.
 H. Erdogan, J. R. Hershey, S. Watanabe, and J. Le Roux, “Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks,” in IEEE International Conference onAcoustics, Speech and Signal Processing, 2015, pp. 708-712.
 Y. Luo and N. Mesgarani, “TasNet: time-domain audio separation network for real-time single-channel speech separation,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2018.
 G. Naithani, T. Barker, G. Parascandolo, L. Bramsløw, N. H. Pontoppidan, and T. Virtanen, “Low latency sound source separation using convolutional recurrent neural networks,” in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2017, pp. 71-75.
 G. Naithani, G. Parascandolo, T. Barker, N. H. Pontoppidan, and T. Virtanen, “Low-latency sound source separation using deep neural networks,” in IEEE Global Conference on Signal and Information Processing, 2016, pp. 272-276.
 L. Bramsløw, “Preferred signal path delay and high-pass cut-off in open fittings,” International Journal of Audiology, vol. 49, no. 9, pp. 634-644, 2010.
 J. Hidalgo, “Low latency audio source separation for speech enhancement in cochlear implants,” Master's thesis, Universitat Pompeu Fabra, 2012.