Personalizing Human Video Pose Estimation

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jun 2016Embargo end date: 01 Jan 2015 United Kingdom Publisher:IEEEJournal:2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)Funded by:UKRI | Learning to Recognise Dyn...

Authors: James Charles; Tomas Pfister; Derek R. Magee; David C. Hogg; Andrew Zisserman;

doi: 10.1109/cvpr.2016.334 , 10.48550/arxiv.1511.06676

arXiv: 1511.06676

Personalizing Human Video Pose Estimation

- Summary
- Subjects
- Metrics

Abstract

We propose a personalized ConvNet pose estimator that automatically adapts itself to the uniqueness of a person's appearance to improve pose estimation in long videos. We make the following contributions: (i) we show that given a few high-precision pose annotations, e.g. from a generic ConvNet pose estimator, additional annotations can be generated throughout the video using a combination of image-based matching for temporally distant frames, and dense optical flow for temporally local frames; (ii) we develop an occlusion aware self-evaluation model that is able to automatically select the high-quality and reject the erroneous additional annotations; and (iii) we demonstrate that these high-quality annotations can be used to fine-tune a ConvNet pose estimator and thereby personalize it to lock on to key discriminative features of the person's appearance. The outcome is a substantial improvement in the pose estimates for the target video using the personalized ConvNet compared to the original generic ConvNet. Our method outperforms the state of the art (including top ConvNet methods) by a large margin on two standard benchmarks, as well as on a new challenging YouTube video dataset. Furthermore, we show that training from the automatically generated annotations can be used to improve the performance of a generic ConvNet on other benchmarks.

CVPR 2016

Country

United Kingdom

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	53
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%