
Segmenting continuous sensory input into coherent segments and subsegments is an important part of perception. Music is no exception. By shaping the acoustic properties of music during performance, musicians can strongly influence the perceived segmentation. Two main techniques musicians employ are the modulation of tempo and dynamics. Such variations carry important information for segmentation and lend themselves well to numerical analysis methods. In this article, based on tempo or loudness modulations alone, we propose a novel end-to-end Bayesian framework using dynamic programming to retrieve a musician's expressed segmentation. The method computes the credence of all possible segmentations of the recorded performance. The output is summarized in two forms: as a beat-by-beat profile revealing the posterior credence of plausible boundaries, and as expanded credence segment maps, a novel representation that converts readily to a segmentation lattice but retains information about the posterior uncertainty on the exact position of segments’ endpoints. To compare any two segmentation profiles, we introduce a method based on unbalanced optimal transport. Experimental results on the MazurkaBL dataset show that despite the drastic dimension reduction from the input data, the segmentation recovery is sufficient for deriving musical insights from comparative examination of recorded performances. This Bayesian segmentation method thus offers an alternative to binary boundary detection and finds multiple hypotheses fitting information from recorded music performances.
bayesian segmentation, comparative analysis, Bayesian inference, 610, [STAT.AP] Statistics [stat]/Applications [stat.AP], music performance, computational algorithm, Psychology, M1-5000, probabilistic segmentation, musical interpretation, music expressivity, dynamic programming, segmentation, [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD], 004, BF1-990, [SHS.MUSIQ] Humanities and Social Sciences/Musicology and performing arts, optimal transport, musical prosody, expressive performance, Music
bayesian segmentation, comparative analysis, Bayesian inference, 610, [STAT.AP] Statistics [stat]/Applications [stat.AP], music performance, computational algorithm, Psychology, M1-5000, probabilistic segmentation, musical interpretation, music expressivity, dynamic programming, segmentation, [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD], 004, BF1-990, [SHS.MUSIQ] Humanities and Social Sciences/Musicology and performing arts, optimal transport, musical prosody, expressive performance, Music
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
