Learning Sparse Representations for Human Action Recognition

descriptionPublicationkeyboard_double_arrow_right Article 01 Aug 2012Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 34, pages 1,576-1,588 (issn: 0162-8828, eissn: 2160-9292,

Copyright policy )

Authors: Tanaya Guha; Rabab K. Ward;

doi: 10.1109/tpami.2011.253

pmid: 22745001

Learning Sparse Representations for Human Action Recognition

- Summary
- Subjects
- Metrics

Abstract

This paper explores the effectiveness of sparse representations obtained by learning a set of overcomplete basis (dictionary) in the context of action recognition in videos. Although this work concentrates on recognizing human movements-physical actions as well as facial expressions-the proposed approach is fairly general and can be used to address other classification problems. In order to model human actions, three overcomplete dictionary learning frameworks are investigated. An overcomplete dictionary is constructed using a set of spatio-temporal descriptors (extracted from the video sequences) in such a way that each descriptor is represented by some linear combination of a small number of dictionary elements. This leads to a more compact and richer representation of the video sequences compared to the existing methods that involve clustering and vector quantization. For each framework, a novel classification algorithm is proposed. Additionally, this work also presents the idea of a new local spatio-temporal feature that is distinctive, scale invariant, and fast to compute. The proposed approach repeatedly achieves state-of-the-art results on several public data sets containing various physical actions and facial expressions.

Related Organizations

University of British Columbia
Canada

Keywords

Databases, Factual, Movement, Video Recording, Models, Theoretical, Pattern Recognition, Automated, Facial Expression, Artificial Intelligence, Terminology as Topic, Image Processing, Computer-Assisted, Humans, Dancing, Algorithms, Sports

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	275
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.1%