
doi: 10.1121/1.4782791
Two sets of generalized, perceptual-based features are investigated for use in classifying animal vocalizations. Since many species, especially mammals, share similar physical sound perception mechanisms which vary in size, two features sets commonly used in human speech processing, mel-frequency cepstral coefficients (MFCCs) and perceptual linear prediction (PLP) analysis, are modified for use in other species. One modification made to the feature extraction process is incorporating the frequency range of hearing and length of the basilar membrane of the animal in order to correctly determine the width and location of the critical band filters used for signal processing. Experimentally determined critical bands (equivalent rectangular bandwidth) and equal loudness curves (audiograms) can also be incorporated directly into the feature extraction process. Experiments are performed on African elephant (Loxodonta africana) vocalizations using a hidden Markov model (HMM) based classifier showing increased classification accuracy when using features sets based on the specific animals perceptual abilities compared to the original human perception-based feature sets.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
