Name: COMPRESS TO CREATE
Creator: Briot, Jean-Pierre
Keywords: [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], [SHS.MUSIQ] Humanities and Social Sciences/Musicology and performing arts, Music generation, Control, Deep learning, Autoencoder, [INFO] Computer Science [cs], Latent variables

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article 01 Jan 2023Publisher:Editora Científica Digital

Authors: Briot, Jean-Pierre;

doi: 10.37885/230814176

COMPRESS TO CREATE

- Summary
- Subjects
- Metrics

Abstract

The current tsunami of deep learning has already conquered new areas, such as the generation of creative content (images, music, text). The motivation is in using the capacity of modern deep learning architectures and associated training and generation techniques to automatically learn styles from arbitrary corpora and then to generate samples from the estimated distribution, with some degree of control over the generation. In this article, we analyze the use of autoencoder architectures and how their ability for compressing information turns out to be an interesting source for generation of music. Autoencoders are good at representation learning, that is at extracting a compressed and abstract representation (a set of latent variables) common to the set of training examples. By choosing various instances of this abstract representation (i.e., by sampling the latent variables), we may efficiently generate various instances within the style which has been learnt. Furthermore, we may use various approaches for controlling the generation, such as interpolation, attribute vector arithmetics, recursion and objective optimization, as will be illustrated by various examples. Before concluding the article, we will discuss some limitations of autoencoders, introduce the concept of variational autoencoders and briefly compare their respective merits and limitations for generating music.

Related Organizations

Institut des sciences de l'information et de leurs interactions
France
Sorbonne Paris Cité
France
Sorbonne University
France
LIP6
France
French National Centre for Scientific Research
France

Keywords

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], [SHS.MUSIQ] Humanities and Social Sciences/Musicology and performing arts, Music generation, Control, Deep learning, Autoencoder, [INFO] Computer Science [cs], Latent variables

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green