Musical Audio Synthesis Using Autoencoding Neural Nets

Conference object, Article English OPEN
Andy M. Sarroff; Michael Casey;
(2014)

With an optimal network topology and tuning of hyperpa-\ud rameters, artificial neural networks (ANNs) may be trained\ud to learn a mapping from low level audio features to one\ud or more higher-level representations. Such artificial neu-\ud ral networks are commonly us... View more
  • References (12)
    12 references, page 1 of 2

    [1] P. Hamel, S. Wood, and D. Eck, “Automatic identification of instrument classes in polyphonic and polyinstrument audio.” in Proceedings of the 10th International Society for Music Information Retrieval Conference (ISMIR-09), Kobe, Japan, October 2009, pp. 399- 404.

    [2] H. Lee, P. T. Pham, Y. Largman, and A. Y. Ng, “Unsupervised feature learning for audio classification using convolutional deep belief networks.” in Proceedings of the Neural Information Processing Systems Foundation (NIPS-14), vol. 9, Vancouver, Canada, December 2009, pp. 1096-1104.

    [3] P. Hamel and D. Eck, “Learning features from music audio with deep belief networks.” in Proceedings of the 11th International Society for Music Information Retrieval Conference (ISMIR-10), Utrecht, Netherlands, September 2010, pp. 339-344.

    [4] S. Dieleman, P. Brakel, and B. Schrauwen, “Audiobased music classification with a pretrained convolutional network,” in Proceedings of the 12th international society for music information retrieval conference (ISMIR-11), Miami, USA, October 2011, pp. 669-674.

    [5] E. Humphrey, A. Glennon, and J. Bello, “Non-linear semantic embedding for organizing large instrument sample libraries,” in Proceedings of the IEEE International Conference on Machine Learning and Applications (ICMLA-11), Honolulu, HI., December 2011.

    [6] E. M. Schmidt, J. J. Scott, and Y. E. Kim, “Feature learning in dynamic environments: Modeling the acoustic structure of musical emotion.” in Proceedings of the 13th international society for music information retrieval conference (ISMIR-12), Porto, Portugal, October 2012, pp. 325-330.

    [7] E. Humphrey and J. Bello, “Rethinking automatic chord recognition with convolutional neural networks,” in Proceedings of the 11th International Conference on Machine Learning and Applications (ICMLA-12), Boca Raton, USA, December 2012.

    [8] E. Humphrey, T. Cho, and J. Bello, “Learning a robust tonnetz-space transform for automatic chord recognition,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-12), Kyoto, Japan, May 2012, pp. 453-456.

    [9] E. M. Schmidt and Y. E. Kim, “Learning rhythm and melody features with deep belief networks,” in Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR-13), Curitiba, Brazil, November 2013.

    [10] I. J. Goodfellow, D. Warde-Farley, P. Lamblin, V. Dumoulin, M. Mirza, R. Pascanu, J. Bergstra, F. Bastien, and Y. Bengio, “Pylearn2: a machine learning research library,” arXiv preprint arXiv:1308.4214, 2013. [Online]. Available: http://arxiv.org/abs/1308.4214

  • Related Organizations (1)
  • Metrics
Share - Bookmark