Scene Parsing With Integration of Parametric and Non-Parametric Models

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 May 2016Embargo end date: 01 Jan 2016Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Image Processing, volume 25, pages 2,379-2,391 (issn: 1057-7149, eissn: 1941-0042,

Copyright policy )

Authors: Bing Shuai; Zhen Zuo; Gang Wang 0012; Bing Wang 0003;

doi: 10.1109/tip.2016.2533862 , 10.48550/arxiv.1604.05848

pmid: 26929044

arXiv: 1604.05848

Scene Parsing With Integration of Parametric and Non-Parametric Models

- Summary
- Subjects
- Metrics

Abstract

We adopt Convolutional Neural Networks (CNNs) to be our parametric model to learn discriminative features and classifiers for local patch classification. Based on the occurrence frequency distribution of classes, an ensemble of CNNs (CNN-Ensemble) are learned, in which each CNN component focuses on learning different and complementary visual patterns. The local beliefs of pixels are output by CNN-Ensemble. Considering that visually similar pixels are indistinguishable under local context, we leverage the global scene semantics to alleviate the local ambiguity. The global scene constraint is mathematically achieved by adding a global energy term to the labeling energy function, and it is practically estimated in a non-parametric framework. A large margin based CNN metric learning method is also proposed for better global belief estimation. In the end, the integration of local and global beliefs gives rise to the class likelihood of pixels, based on which maximum marginal inference is performed to generate the label prediction maps. Even without any post-processing, we achieve state-of-the-art results on the challenging SiftFlow and Barcelona benchmarks.

13 Pages, 6 figures, IEEE Transactions on Image Processing (T-IP) 2016

Related Organizations

Nanyang Technological University
Singapore

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Image processing (compression, reconstruction, etc.) in information and communication theory

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	15
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%