Object removal by exemplar-based inpainting

Criminisi, Antonio; Pérez, Patrick; Toyama, Kentaro

doi:10.1109/cvpr.2003.1211538

Cited by 659 publications

(570 citation statements)

References 23 publications

Supporting

Mentioning

565

Contrasting

Unclassified

Order By: Relevance

“…In Fg-Mask, however, no appearance information about the background is available. We therefore apply the technique of [28] to inpaint this area, which, to the best of our knowledge, remains the most mature method when it comes to depth completion without intensity information. This yields two baselines, which we will refer to as Baseline-1 (semantic segmentation followed by [29] + [28]) and Baseline-2 (semantic segmentation followed by [27] + [28]).…”

Section: Methodsmentioning

confidence: 99%

“…We therefore apply the technique of [28] to inpaint this area, which, to the best of our knowledge, remains the most mature method when it comes to depth completion without intensity information. This yields two baselines, which we will refer to as Baseline-1 (semantic segmentation followed by [29] + [28]) and Baseline-2 (semantic segmentation followed by [27] + [28]). To compare the different algorithms, we make use of the following metrics:1) visible-rmse: the-root-mean-square-error (rmse) for the entire depth map; 2)hidden-rmse: the rmse for the depth map hallucinated underneath the ground truth foreground mask.…”

Section: Methodsmentioning

confidence: 99%

“…In [27], depth completion is formulated within a total variation framework where image cues guide the completion process. A different approach to depth completion consists of treating a depth map as an intensity image, and rely on standard image inpainting algorithms, such as [28] and [29]. All the above-mentioned methods focus on depth completion form a single view and aim at completing the visible scene information only.By contrast, some approaches have proposed to exploit multiple views [30,14] and thus can handle the fact that parts of the scene are hidden in some views, albeit not all of them.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Building Scene Models by Completing and Hallucinating Depth and Semantics

Liu

Salzmann

2016

Computer Vision – ECCV 2016

View full text Add to dashboard Cite

Abstract. Building 3D scene models has been a longstanding goal of computer vision. The great progress in depth sensors brings us one step closer to achieving this in a single shot. However, depth sensors still produce imperfect measurements that are sparse and contain holes. While depth completion aims at tackling this issue, it ignores the fact that some regions of the scene are occluded by the foreground objects. Building a scene model would therefore require to hallucinate the depth behind these objects. In contrast with existing methods that either rely on manual input, or focus on the indoor scenario, we introduce a fully-automatic method to jointly complete and hallucinate depth and semantics in challenging outdoor scenes. To this end, we develop a two-layer model representing both the visible information and the hidden one. At the heart of our approach lies a formulation based on the Mumford-Shah functional, for which we derive an effective optimization strategy. Our experiments evidence that our approach can accurately fill the large holes in the input depth maps, segment the different kinds of objects in the scene, and hallucinate the depth and semantics behind the foreground objects.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Building Scene Models by Completing and Hallucinating Depth and Semantics

Liu

Salzmann

2016

Computer Vision – ECCV 2016

View full text Add to dashboard Cite

show abstract

“…More specifically, it requires that the pixel values of the missing regions follow the same statistical or geometric structures as the rest of the image. In the literature, based on different assumptions on the (local or global) statistics or structures of the input image, many methods which can directly or indirectly deal with image completion problems have been developed for different types of images or textures: from the synthesis of stationary stochastic random textures [1][2][3], to the inpainting of piece-wise smooth natural images [4][5][6] , and to the synthesis of highly symmetric and highly-structured regular textures [7]. In this paper, we focus on the class of images or textures whose structures have very low intrinsic dimensionality or complexity.…”

Section: Introductionmentioning

confidence: 99%

Repairing Sparse Low-Rank Texture

Liang

Ren

Zhang

et al. 2012

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. In this paper, we show how to harness both low-rank and sparse structures in regular or near regular textures for image completion. Our method leverages the new convex optimization for low-rank and sparse signal recovery and can automatically correctly repair the global structure of a corrupted texture, even without precise information about the regions to be completed. Through extensive simulations, we show our method can complete and repair textures corrupted by errors with both random and contiguous supports better than existing low-rank matrix recovery methods. Through experimental comparisons with existing image completion systems (such as Photoshop) our method demonstrate significant advantage over local patch based texture synthesis techniques in dealing with large corruption, non-uniform texture, and large perspective deformation.

show abstract

“…Exemplarbased algorithms, e.g. [13], fill in missing image data or remove certain image objects by searching for similar regions or structures in the image and completing the missing data or overwriting the data to be removed according to the information found there. A very popular denoising approach that is motivated by the inpainting method of Efros and Leung [15] is the so-called nonlocal means (NL means) algorithm for image denoising that has been proposed by Buades et al [7,8].…”

Section: Introductionmentioning

confidence: 99%

Rotationally invariant similarity measures for nonlocal image denoising

Grewenig

Zimmer

Weickert

2011

Journal of Visual Communication and Image Representation

View full text Add to dashboard Cite

Many natural or texture images contain structures that appear several times in the image. One of the denoising filters that successfully take advantage of such repetitive regions is the nonlocal means filter. It is simple and yields very good denoising results. Unfortunately, the block matching within the standard nonlocal means filter is not able to handle rotation or mirroring. Rotated or mirrored instances are not detected as variations of the corresponding original structures. In this paper, we analyse two natural approaches for a rotationally invariant similarity measure that will be used as an alternative to, respectively a modification of the well-known block matching algorithm in nonlocal means denoising. The first approach is based on similarity distances computed with the help of moment invariants whereas the second one estimates the rotation angle, rotates the block via interpolation and then uses a standard block matching. In contrast to the standard method, the presented algorithms can find similar regions or patches in an image even if they appear in several rotated or mirrored instances. With this modification, the nonlocal means filter is able to find more suitable regions for its weighted average.

show abstract

Object removal by exemplar-based inpainting

Cited by 659 publications

References 23 publications

Building Scene Models by Completing and Hallucinating Depth and Semantics

Building Scene Models by Completing and Hallucinating Depth and Semantics

Repairing Sparse Low-Rank Texture

Rotationally invariant similarity measures for nonlocal image denoising

Contact Info

Product

Resources

About