
pmid: 20431125
The capacity for real-time synchronization and coordination is a common ability among trained musicians performing a music score that presents an interesting challenge for machine intelligence. Compared to speech recognition, which has influenced many music information retrieval systems, music's temporal dynamics and complexity pose challenging problems to common approximations regarding time modeling of data streams. In this paper, we propose a design for a real-time music-to-score alignment system. Given a live recording of a musician playing a music score, the system is capable of following the musician in real time within the score and decoding the tempo (or pace) of its performance. The proposed design features two coupled audio and tempo agents within a unique probabilistic inference framework that adaptively updates its parameters based on the real-time context. Online decoding is achieved through the collaboration of the coupled agents in a Hidden Hybrid Markov/semi-Markov framework, where prediction feedback of one agent affects the behavior of the other. We perform evaluations for both real-time alignment and the proposed temporal model. An implementation of the presented system has been widely used in real concert situations worldwide and the readers are encouraged to access the actual system and experiment the results.
[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Realtime Processing, Anticipatory Systems, Time Factors, [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing, Antescofo, [INFO] Computer Science [cs], Informatique musicale, Pattern Recognition, Automated, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Artificial Intelligence, Humans, score following, Real-time systems, Hidden Markov Models, Score Following, [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing, [INFO.INFO-MM] Computer Science [cs]/Multimedia [cs.MM], Stochastic Processes, Models, Statistical, Anticipatory Modeling, Signal Processing, Computer-Assisted, [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], Duration Models, [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD], [INFO.INFO-IA] Computer Science [cs]/Computer Aided Engineering, Markov Chains, [SHS.MUSIQ] Humanities and Social Sciences/Musicology and performing arts, [INFO.INFO-OH] Computer Science [cs]/Other [cs.OH], Audio to Score Alignment, antescofo, Auditory Perception, [INFO.INFO-HC] Computer Science [cs]/Human-Computer Interaction [cs.HC], Algorithms, Music
[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Realtime Processing, Anticipatory Systems, Time Factors, [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing, Antescofo, [INFO] Computer Science [cs], Informatique musicale, Pattern Recognition, Automated, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Artificial Intelligence, Humans, score following, Real-time systems, Hidden Markov Models, Score Following, [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing, [INFO.INFO-MM] Computer Science [cs]/Multimedia [cs.MM], Stochastic Processes, Models, Statistical, Anticipatory Modeling, Signal Processing, Computer-Assisted, [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], Duration Models, [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD], [INFO.INFO-IA] Computer Science [cs]/Computer Aided Engineering, Markov Chains, [SHS.MUSIQ] Humanities and Social Sciences/Musicology and performing arts, [INFO.INFO-OH] Computer Science [cs]/Other [cs.OH], Audio to Score Alignment, antescofo, Auditory Perception, [INFO.INFO-HC] Computer Science [cs]/Human-Computer Interaction [cs.HC], Algorithms, Music
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 60 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
