Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jun 2014 France Publisher:IEEEJournal:2014 IEEE Conference on Computer Vision and Pattern RecognitionFunded by:EC | ACTIVIA

Authors: Oquab, Maxime; Bottou, Léon; Laptev, Ivan; Sivic, Josef;

doi: 10.1109/cvpr.2014.222

Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks

- Summary
- Subjects
- Metrics

Abstract

Convolutional neural networks (CNN) have recently shown outstanding image classification performance in the large-scale visual recognition challenge (ILSVRC2012). The success of CNNs is attributed to their ability to learn rich mid-level image representations as opposed to hand-designed low-level features used in other image classification methods. Learning CNNs, however, amounts to estimating millions of parameters and requires a very large number of annotated image samples. This property currently prevents application of CNNs to problems with limited training data. In this work we show how image representations learned with CNNs on large-scale annotated datasets can be efficiently transferred to other visual recognition tasks with limited amount of training data. We design a method to reuse layers trained on the ImageNet dataset to compute mid-level image representation for images in the PASCAL VOC dataset. We show that despite differences in image statistics and tasks in the two datasets, the transferred representation leads to significantly improved results for object and action classification, outperforming the current state of the art on Pascal VOC 2007 and 2012 datasets. We also show promising results for object and action localization.

Country

France

Related Organizations

French National Centre for Scientific Research
France
École Normale Supérieure
France
PSL Research University
France
Microsoft Research New York City (United States)
United States
French Institute for Research in Computer Science and Automation
France

View all View all

Keywords

[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2K
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.01%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.01%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.01%