2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings.
DOI: 10.1109/cvpr.2003.1211480
|View full text |Cite
|
Sign up to set email alerts
|

3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints

Abstract: This paper presents a novel representation for three-dimensional objects in terms of affine-invariant image patches and their spatial relationships. Multi-view constraints associated with groups of patches are combined with a normalized representation of their appearance to guide matching and reconstruction, allowing the acquisition of true three-dimensional affine and Euclidean models from multiple images and their recognition in a single photograph taken from an arbitrary viewpoint. The proposed approach doe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
70
0

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 90 publications
(70 citation statements)
references
References 23 publications
0
70
0
Order By: Relevance
“…3 This allows the use of a sparse set of stereo training views (7 to 12 pairs in our experiments) for the modeling. We also extend to 3D object models the idea proposed in [6] in the image matching domain, and augment the model patches associated with interest points of [8] (called primary patches from now on) with more general secondary patches. This allows us to cover the object densely, utilize all the available texture information in the training images, and effectively handle clutter and occlusion in recognition tasks.…”
Section: Introductionmentioning
confidence: 99%
See 4 more Smart Citations
“…3 This allows the use of a sparse set of stereo training views (7 to 12 pairs in our experiments) for the modeling. We also extend to 3D object models the idea proposed in [6] in the image matching domain, and augment the model patches associated with interest points of [8] (called primary patches from now on) with more general secondary patches. This allows us to cover the object densely, utilize all the available texture information in the training images, and effectively handle clutter and occlusion in recognition tasks.…”
Section: Introductionmentioning
confidence: 99%
“…In this case, the 3D model explicitly integrates the various model views, but the determination of the 3D position and orientation of a patch on the object requires it to be visible in three or more training images [8], and hence requires a large number of closely separated training images for modeling the object. Also, [8] only makes use of patches centered at interest points, so the model constructed is sparse and does not encode all the available information in the training images. We tackle these issues by using calibrated stereo pairs to construct partial 3D object models and then register these models together to form a full model.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations