publication . Preprint . 2016

Learning Features of Music from Scratch

Thickstun, John; Harchaoui, Zaid; Kakade, Sham;
Open Access English
  • Published: 29 Nov 2016
This paper introduces a new large-scale music dataset, MusicNet, to serve as a source of supervision and evaluation of machine learning methods for music research. MusicNet consists of hundreds of freely-licensed classical music recordings by 10 composers, written for 11 instruments, together with instrument/note annotations resulting in over 1 million temporal labels on 34 hours of chamber music performances under various studio and microphone conditions. The paper defines a multi-label classification task to predict notes in musical recordings, along with an evaluation protocol, and benchmarks several machine learning architectures for this task: i) learning f...
free text keywords: Statistics - Machine Learning, Computer Science - Learning, Computer Science - Sound
Download from
31 references, page 1 of 3

E. Benetos and S. Dixon. Joint multi-pitch detection using harmonic envelope estimation for polyphonic music transcription. IEEE Selected Topics in Signal Processing, 2011. [OpenAIRE]

T. Berg-Kirkpatrick, J. Andreas, and D. Klein. Unsupervised transcription of piano music. NIPS, 2014.

K. Choi, G. Fazes, and M. Sandler. Automatic tagging using deep convolutional neural networks. ISMIR, 2016.

S. Dieleman and B. Schrauwen. End-to-end learning for music audio. ICASSP, 2014. [OpenAIRE]

J. Driedger, T. Pra¨tzlich, and M. Mu¨ller. Let It Bee - Towards NMF-inspired audio mosaicing. ISMIR, 2015. [OpenAIRE]

Z. Duan, B. Pardo, and C. Zhang. Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions. TASLP, 2011.

V. Emiya, R. Badeau, and B. David. Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle. TASLP, 2010. [OpenAIRE]

D. Garreau, R. Lajugie, S. Arlot, and F. Bach. Metric learning for temporal sequence alignment. NIPS, 2014. [OpenAIRE]

M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka. RWC music database: Music genre database and musical instrument sound database. ISMIR, 2003.

C. Harte. Towards Automatic Extraction of Harmony Information from Music Signals. PhD thesis, Department of Electrical Engineering, Queen Mary, University of London, 2010.

N. Hu, R. B. Dannenberg, and G. Tzanetakis. Polyphonic audio matching and alignment for music retrieval. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2003.

E. J. Humphrey, J. P. Bello, and Y. LeCun. Moving beyond feature design: Deep architectures and automatic feature learning in music informatics. ISMIR, 2012.

O. Izmirli and R. B. Dannenberg. Understanding features and distance functions for music sequence alignment. ISMIR, 2010. [OpenAIRE]

C. Joder, S. Essid, and G. Richard. Learning optimal features for polyphonic audio-to-score alignment. TASLP, 2013. [OpenAIRE]

A. Khlif and V. Sethu. An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription. ISMIR, 2015. [OpenAIRE]

31 references, page 1 of 3
Powered by OpenAIRE Open Research Graph
Any information missing or wrong?Report an Issue