3D Convolutional Neural Networks for Human Action Recognition

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jan 2013Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 35, pages 221-231 (issn: 0162-8828, eissn: 2160-9292,

Copyright policy )

Authors: Shuiwang Ji; Wei Xu 0007; Ming Yang 0007; Kai Yu 0001;

doi: 10.1109/tpami.2012.59

pmid: 22392705

3D Convolutional Neural Networks for Human Action Recognition

- Summary
- Subjects
- Metrics

Abstract

We consider the automated recognition of human actions in surveillance videos. Most current methods build classifiers based on complex handcrafted features computed from the raw inputs. Convolutional neural networks (CNNs) are a type of deep model that can act directly on the raw inputs. However, such models are currently limited to handling 2D inputs. In this paper, we develop a novel 3D CNN model for action recognition. This model extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames. The developed model generates multiple channels of information from the input frames, and the final feature representation combines information from all channels. To further boost the performance, we propose regularizing the outputs with high-level features and combining the predictions of a variety of different models. We apply the developed models to recognize human actions in the real-world environment of airport surveillance videos, and they achieve superior performance in comparison to baseline methods.

Related Organizations

Baidu (China)
China (People's Republic of)
Facebook (Israel)
Israel
Old Dominion University
United States

Keywords

Imaging, Three-Dimensional, Movement, Subtraction Technique, Image Interpretation, Computer-Assisted, Neural Networks, Computer, Algorithms, Decision Support Techniques, Pattern Recognition, Automated

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4K
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.01%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.01%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.01%

Found an issue? Give us feedback

4K

Top 0.01%

bronze

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering