
This dataset comprises audio excerpts and manually done annotations of the melodic phrases in Carnatic and Hindustani music. This dataset can be used to develop and evaluate approaches for computing melodic similarity between short-time melodic patterns in Indian art music. This dataset is divided into two parts, one for Carnatic music (CMD), and the other for Hindustani music (HMD). There are two versions of the dataset available: Original version These two datasets, CMD and HMD are compiled originally by the authors of iswar2013 and ross2012, respectively. Though, they have evolved over time and have been recompiled along with the extracted audio features. Please cite if you use the material shared here in your research work. Gulati, S., Serrà, J., & Serra, X. (2015). An evaluation of methodologies for melodic similarity in audio recordings of Indian art music. In Proceedings of the 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 678–682. Brisbane, Australia. [Postprint PDF@MTG] Improved version It was found that several instances of melodic phrases were not marked in the annotations. The missing phrases have been added in the improved version of the dataset. Please cite the following publication if you use the material shared here in your research work. Gulati, S., Serrà, J., & Serra, X. (2015). Improving melodic similarity in Indian art music using culture-specific melodic characteristics. In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR), pp. 680–686. Málaga, Spain. [Postprint PDF@MTG] Dataset structure The dataset is divided into two parts, Carnatic and Hindustani. Carnatic has 23 folders for each song. In each folder, there are the following files named: .mp3: Performance audio. .anot: Contains the original annotations. .anotEdit1: Contains the improved annotations. .flatSegNyas: Contains nyas annotations. .pitch: Contains original pitch annotations. .pitchSilIntrpPP: Contains improved pitch annotations. .tonic: Tonic of the performance. .tonicFine: Finetuned tonic of the performance. Hindustani has 9 folders for each song. In each folder, there are the following files named: .wav: Performance audio. .anot: Contains the original annotations. .anotEdit4: Contains the improved annotations. .flatSegNyas: Contains nyas annotations. .tpe: Contains original pitch annotations. .tpe5msSilIntrpPP: Contains improved pitch annotations. .tonic: Tonic of the performance. .tonicFine: Finetuned tonic of the performance. Annotation file contains tab separated values with format as: Mirdata This dataset is included in mirdata. Use the following code snippet to access the dataset in mirdata. # Import midata import mirdata # Initialize dataset dataset_name = 'iam_melodic_similarity' data_home = 'mirdata/dataset' dataset = mirdata.initialize(dataset_name, data_home=data_home) # Download dataset dataset.download() # Validate dataset dataset.validate() # Load dataset as a dictionary with track ids as keys and track objects as values data = dataset.load_tracks() Contact If you have any questions or comments about the dataset, please feel free to email: mtg-info@upf.edu
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
