descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jun 2016Embargo end date: 01 Jan 2016Publisher:IEEEJournal:2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Authors: Jacobsen, J.-H.; van Gemert, J.; Lou, Z.; Smeulders, A.W.M.;

doi: 10.1109/cvpr.2016.286 , 10.48550/arxiv.1605.02971

arXiv: http://arxiv.org/abs/1605.02971

handle: 11245.1/b363c1e9-631a-4c0f-9541-1c485098bb4a

Structured Receptive Fields in CNNs

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Learning powerful feature representations with CNNs is hard when training data are limited. Pre-training is one way to overcome this, but it requires large datasets sufficiently similar to the target domain. Another option is to design priors into the model, which can range from tuned hyperparameters to fully engineered representations like Scattering Networks. We combine these ideas into structured receptive field networks, a model which has a fixed filter basis and yet retains the flexibility of CNNs. This flexibility is achieved by expressing receptive fields in CNNs as a weighted sum over a fixed basis which is similar in spirit to Scattering Networks. The key difference is that we learn arbitrary effective filter sets from the basis rather than modeling the filters. This approach explicitly connects classical multiscale image analysis with general CNNs. With structured receptive field networks, we improve considerably over unstructured CNNs for small and medium dataset scenarios as well as over Scattering for large datasets. We validate our findings on ILSVRC2012, Cifar-10, Cifar-100 and MNIST. As a realistic small dataset example, we show state-of-the-art classification results on popular 3D MRI brain-disease datasets where pre-training is difficult due to a lack of large public datasets in a similar domain.

Reason for update: i) Fix Reference for "Deep roto-translation scattering for object classification" by Oyallon and Mallat. ii) Fixed two minor typos. iii) Removed implicit assumption in equation (4) where scale is represented with diffusion time and adapted to rest of paper where scale is represented with standard deviation, to avoid possible confusion

Related Organizations

Delft University of Technology - Faculty of Applied Sciences - Department of Chemical Engineering
Netherlands
University of Amsterdam
Netherlands
Delft University of Technology
Netherlands
UNIVERSITEIT VAN AMSTERDAM
Netherlands

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 500, 510

1 Research products, page 1 of 1

gfnn software on GitHub
IsRelatedTo

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	44
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%