
handle: 10023/21545
AbstractVideo data are widely collected in ecological studies but manual annotation is a challenging and time-consuming task, and has become a bottleneck for scientific research. Classification models based on convolutional neural networks (CNNs) have proved successful in annotating images, but few applications have extended these to video classification. We demonstrate an approach that combines a standard CNN summarizing each video frame with a recurrent neural network (RNN) that models the temporal component of video. The approach is illustrated using two datasets: one collected by static video cameras detecting seal activity inside coastal salmon nets, and another collected by animal-borne cameras deployed on African penguins, used to classify behaviour. The combined RNN-CNN led to a relative improvement in test set classification accuracy over an image-only model of 25% for penguins (80% to 85%), and substantially improved classification precision or recall for four of six behaviour classes (12–17%). Image-only and video models classified seal activity with equally high accuracy (90%). Temporal patterns related to movement provide valuable information about animal behaviour, and classifiers benefit from including these explicitly. We recommend the inclusion of temporal information whenever manual inspection suggests that movement is predictive of class membership.
QA75, Image classification, QA75 Electronic computers. Computer science, QH301 Biology, automated detection, Video classification, 510, Automated detection, QH301, animal‐borne video, QH540-549.5, GC, Animal-borne video, Ecology, deep learning, Deep learning, DAS, neural networks, 004, GC Oceanography, video classification, Neural networks, image classification
QA75, Image classification, QA75 Electronic computers. Computer science, QH301 Biology, automated detection, Video classification, 510, Automated detection, QH301, animal‐borne video, QH540-549.5, GC, Animal-borne video, Ecology, deep learning, Deep learning, DAS, neural networks, 004, GC Oceanography, video classification, Neural networks, image classification
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 16 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
