
In recent years, the guitar has received increased attention from the music information retrieval (MIR) community driven by the challenges posed by its diverse playing techniques and sonic characteristics. Mainly fueled by deep learning approaches, progress has been limited by the scarcity and limited annotations of datasets. To address this, we present the Guitar On Audio and Tablatures (GOAT) dataset, comprising 5.9 hours of unique high-quality direct input audio recordings of electric guitars from a variety of different guitars and players. We also present an effective data augmentation strategy using guitar amplifiers which delivers near-unlimited tonal variety, of which we provide a starting 29.5 hours of audio. Each recording is annotated using guitar tablatures, a guitar-specific symbolic format supporting string and fret numbers, as well as numerous playing techniques. For this we utilise both the Guitar Pro format, a software for tablature playback and editing, and a text-like token encoding. Furthermore, we present competitive results using GOAT for MIDI transcription and preliminary results for a novel approach to automatic guitar tablature transcription. We hope that GOAT opens up the possibilities to train novel models on a wide variety of guitar-related MIR tasks, from synthesis to transcription to playing technique detection.
To be published in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2025
[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], FOS: Computer and information sciences, Sound (cs.SD), Audio effects, Style transfer, Synthesis, Guitar, Sound, Artificial Intelligence (cs.AI), Artificial Intelligence, Audio and Speech Processing (eess.AS), Flow matching, FOS: Electrical engineering, electronic engineering, information engineering, [INFO.INFO-IR] Computer Science [cs]/Information Retrieval [cs.IR], Audio and Speech Processing
[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], FOS: Computer and information sciences, Sound (cs.SD), Audio effects, Style transfer, Synthesis, Guitar, Sound, Artificial Intelligence (cs.AI), Artificial Intelligence, Audio and Speech Processing (eess.AS), Flow matching, FOS: Electrical engineering, electronic engineering, information engineering, [INFO.INFO-IR] Computer Science [cs]/Information Retrieval [cs.IR], Audio and Speech Processing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
