
doi: 10.3390/app10124176
handle: 11577/3349059 , 11585/784601
In this work, we combine a Siamese neural network and different clustering techniques to generate a dissimilarity space that is then used to train an SVM for automated animal audio classification. The animal audio datasets used are (i) birds and (ii) cat sounds, which are freely available. We exploit different clustering methods to reduce the spectrograms in the dataset to a number of centroids that are used to generate the dissimilarity space through the Siamese network. Once computed, we use the dissimilarity space to generate a vector space representation of each pattern, which is then fed into an support vector machine (SVM) to classify a spectrogram by its dissimilarity vector. Our study shows that the proposed approach based on dissimilarity space performs well on both classification problems without ad-hoc optimization of the clustering methods. Moreover, results show that the fusion of CNN-based approaches applied to the animal audio classification problem works better than the stand-alone CNNs.
Technology, QH301-705.5, T, Physics, QC1-999, pattern recognition, dissimilarity space, Engineering (General). Civil engineering (General), Chemistry, audio classification, ensemble of classifiers, Animal audio; Audio classification; Dissimilarity space; Ensemble of classifiers; Pattern recognition; Siamese network, siamese network, TA1-2040, Biology (General), QD1-999, animal audio
Technology, QH301-705.5, T, Physics, QC1-999, pattern recognition, dissimilarity space, Engineering (General). Civil engineering (General), Chemistry, audio classification, ensemble of classifiers, Animal audio; Audio classification; Dissimilarity space; Ensemble of classifiers; Pattern recognition; Siamese network, siamese network, TA1-2040, Biology (General), QD1-999, animal audio
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 27 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
