2014 IEEE Conference on Computer Vision and Pattern Recognition 2014
DOI: 10.1109/cvpr.2014.210
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Detect Ground Control Points for Improving the Accuracy of Stereo Matching

Abstract: While machine learning has been instrumental to the ongoing progress in most areas of computer vision, it has not been applied to the problem of stereo matching with similar frequency or success. We present a supervised learning approach for predicting the correctness of stereo matches based on a random forest and a set of features that capture various forms of information about each pixel.We show highly competitive results in predicting the correctness of matches and in confidence estimation, which allows us … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
139
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 104 publications
(139 citation statements)
references
References 25 publications
0
139
0
Order By: Relevance
“…A set of active matchers is selected from a potentially very large initial pool and random forest classifiers are trained to select the disparity that is most likely to be correct for each pixel of the left image. The classifiers, even with a small number of features compared to [14,34], are always able to surpass the accuracy of the best matcher in the active set. Our method combines eight matchers and achieves an error rate before post-processing (5.82%) that is significantly smaller than that of the best individual matcher (8.06%).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…A set of active matchers is selected from a potentially very large initial pool and random forest classifiers are trained to select the disparity that is most likely to be correct for each pixel of the left image. The classifiers, even with a small number of features compared to [14,34], are always able to surpass the accuracy of the best matcher in the active set. Our method combines eight matchers and achieves an error rate before post-processing (5.82%) that is significantly smaller than that of the best individual matcher (8.06%).…”
Section: Discussionmentioning
confidence: 99%
“…It uses a number of features computed on the images, disparity maps and matching cost volume and a random forest classifier that is able to reject false matches with high accuracy. Spyropoulos et al [34] use a random forest to predict match correctness and to select ground control points which provide constraints that improve the overall accuracy. Zbontar and LeCun [40] trained a convolutional neural network (CNN) to predict whether two image patches match or not.…”
Section: Related Workmentioning
confidence: 99%
“…Haeusler et al [14] trained a random forest classifier using a number of confidence measures as features to make predictions about the correctness of the outputs of the semi-global matching algorithm. Spyropoulos et al [34] used a similar classification approach, but also demonstrated that such a classifier can be used to select ground control points, which in turn can help improve the accuracy of the input disparity maps. Park and Yoon [29] also use a number of confidence measures as features in a random forest classifier that predicts the correctness of WTA disparities.…”
Section: Related Workmentioning
confidence: 99%
“…Recent research on confidence estimation [14,34,29] integrates multiple sources of information within a discriminative learning framework, instead of relying on a single feature for each pixel [11,16]. As expected, taking into account multiple features leads to improved performance since different failure modes can be detected, while individual features typically respond to one failure mode.…”
Section: Introductionmentioning
confidence: 99%
“…The patch consists in a two channels. 1st channel is coming from an idea that neighboring pixels on a disparity map D 1 which have consistent disparities are more likely to be correct matching [7]. In 2nd channel, a disparity D 2 from another image is considered such that the matches from left to right image should be consistent with those from right to left [1].…”
mentioning
confidence: 99%