Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
versions View all 2 versions
addClaim

WikiMuTe: A web-sourced dataset of semantic descriptions for music audio

Authors: Weck, Benno; Kirchhoff, Holger; Grosche, Peter; Xavier, Serra;

WikiMuTe: A web-sourced dataset of semantic descriptions for music audio

Abstract

This upload contains the supplementary material for our paper presented at the MMM2024 conference. Dataset The dataset contains rich text descriptions for music audio files collected from Wikipedia articles. The audio files are freely accessible and available for download through the URLs provided in the dataset. Example A few hand-picked, simplified examples of the dataset. file aspects sentences 🔈 Bongo sound.wav ['bongoes', 'percussion instrument', 'cumbia', 'drums'] ['a loop of bongoes playing a cumbia beat at 99 bpm'] 🔈 Example of double tracking in a pop-rock song (3 guitar tracks).ogg ['bass', 'rock', 'guitar music', 'guitar', 'pop', 'drums'] ['a pop-rock song'] 🔈 OriginalDixielandJassBand-JazzMeBlues.ogg ['jazz standard', 'instrumental', 'jazz music', 'jazz'] ['Considered to be a jazz standard', 'is an jazz composition'] 🔈 Colin Ross - Etherea.ogg ['chirping birds', 'ambient percussion', 'new-age', 'flute', 'recorder', 'single instrument', 'woodwind'] ['features a single instrument with delayed echo, as well as ambient percussion and chirping birds', 'a new-age composition for recorder'] 🔈 Belau rekid (instrumental).oga ['instrumental', 'brass band'] ['an instrumental brass band performance'] ... ... ... Dataset structure We provide three variants of the dataset in the data folder. All are described in the paper. all.csv contains all the data we collected, without any filtering. filtered_sf.csv contains the data obtained using the self-filtering method. filtered_mc.csv contains the data obtained using the MusicCaps dataset method. File structure Each CSV file contains the following columns: file: the name of the audio file pageid: the ID of the Wikipedia article where the text was collected from aspects: the short-form (tag) description texts collected from the Wikipedia articles sentences: the long-form (caption) description texts collected from the Wikipedia articles audio_url: the URL of the audio file url: the URL of the Wikipedia article where the text was collected from Citation If you use this dataset in your research, please cite the following paper: @inproceedings{wikimute, title = {WikiMuTe: {A} Web-Sourced Dataset of Semantic Descriptions for Music Audio}, author = {Weck, Benno and Kirchhoff, Holger and Grosche, Peter and Serra, Xavier}, booktitle = "MultiMedia Modeling", year = "2024", publisher = "Springer Nature Switzerland", address = "Cham", pages = "42--56", doi = {10.1007/978-3-031-56435-2_4}, url = {https://doi.org/10.1007/978-3-031-56435-2_4},} License The data is available under the Creative Commons Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) license. Each entry in the dataset contains a URL linking to the article, where the text data was collected from.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Related to Research communities