Single View Stereo Matching

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jun 2018Embargo end date: 01 Jan 2018Publisher:IEEEJournal:2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

Authors: Yue Luo; Jimmy S. J. Ren; Mude Lin; Jiahao Pang; Wenxiu Sun; Hongsheng Li 0001; Liang Lin;

doi: 10.1109/cvpr.2018.00024 , 10.48550/arxiv.1803.02612

arXiv: 1803.02612

Single View Stereo Matching

- Summary
- Subjects
- Metrics

Abstract

Previous monocular depth estimation methods take a single view and directly regress the expected results. Though recent advances are made by applying geometrically inspired loss functions during training, the inference procedure does not explicitly impose any geometrical constraint. Therefore these models purely rely on the quality of data and the effectiveness of learning to generalize. This either leads to suboptimal results or the demand of huge amount of expensive ground truth labelled data to generate reasonable results. In this paper, we show for the first time that the monocular depth estimation problem can be reformulated as two sub-problems, a view synthesis procedure followed by stereo matching, with two intriguing properties, namely i) geometrical constraints can be explicitly imposed during inference; ii) demand on labelled depth data can be greatly alleviated. We show that the whole pipeline can still be trained in an end-to-end fashion and this new formulation plays a critical role in advancing the performance. The resulting model outperforms all the previous monocular depth estimation methods as well as the stereo block matching method in the challenging KITTI dataset by only using a small number of real training data. The model also generalizes well to other monocular depth estimation benchmarks. We also discuss the implications and the advantages of solving monocular depth estimation using stereo methods.

Spotlight in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018

Related Organizations

Sun Yat-sen University
China (People's Republic of)
Chinese University of Hong Kong
China (People's Republic of)
SenseTime
Hong Kong
THE CHINESE UNIVERSITY OF HONG KONG
China (People's Republic of)

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	135
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

135

Top 1%

Top 10%

Top 1%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering