2021
DOI: 10.48550/arxiv.2103.15068
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ManhattanSLAM: Robust Planar Tracking and Mapping Leveraging Mixture of Manhattan Frames

Abstract: In this paper, a robust RGB-D SLAM system is proposed to utilize the structural information in indoor scenes, allowing for accurate tracking and efficient dense mapping on a CPU. Prior works have used the Manhattan World (MW) assumption to estimate low-drift camera pose, in turn limiting the applications of such systems. This paper, in contrast, proposes a novel approach delivering robust tracking in MW and non-MW environments. We check orthogonal relations between planes to directly detect Manhattan Frames, m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(12 citation statements)
references
References 39 publications
0
12
0
Order By: Relevance
“…Despite its advantages, this method can not reduce the long-term rotation error as the MW assumption does. Another solution is proposed in [13], where the authors use either a decoupled or a non-decoupled tracking strategy depending on whether the scene meets the MW assumption. These strategies permit these works to not only focus on a specific environment.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Despite its advantages, this method can not reduce the long-term rotation error as the MW assumption does. Another solution is proposed in [13], where the authors use either a decoupled or a non-decoupled tracking strategy depending on whether the scene meets the MW assumption. These strategies permit these works to not only focus on a specific environment.…”
Section: Related Workmentioning
confidence: 99%
“…This assumption is fundamentally used during the tracking stage [8]- [12]. Nonetheless, these methods do not usually take into account that some indoor environments are not strictly adhering to this assumption, leading to degradation in accuracy or even to tracking failures [13]. Additionally, most of these works rely on planes to estimate and track the MA.…”
Section: Introductionmentioning
confidence: 99%
“…Based on RGB-D images, tracking and mapping methods [25] are used to build global 3D maps that are bridges for complete scene understanding systems. Multi-featurebased trackers [17], [18] achieve robust estimations in indoor scenes, but those mapping parts aim to maintain sparse features for removing camera pose drift rather than reconstructing dense maps. Different from those methods, KinectFision [26] and BundleFusion [27] focus their mind on dense reconstruction by using GPUs to obtain on-the-fly 3D scene reconstruction.…”
Section: Scene Understanding From Rgb-d Sequencesmentioning
confidence: 99%
“…Compared to the existing 3D segmentation networks [3], [9] and scene graph generation approach [15], our dense semantic reconstruction method has a more complete function that adopts continuous RGB-D frames and outputs dense semantic 3D models. Based on 2D segmentation algorithms [6], [7] and our camera trackers [17], [18], we maintain a semantic sparse map that saves the probability of each object. Since semantic predictions from partial views are not as reliable as full ones', the semantic sparse map is used to correct wrong 2D segments existed in bad cases.…”
Section: Introductionmentioning
confidence: 99%
“…In indoor environments, orthogonal or parallel planar regions (walls, floor and ceiling) can form a layout that satisfies the Manhattan World (MW) assumption. Some SLAM systems [2,6,14,15,30,33,46] employ such structural features for fast dense map reconstruction or low drift localization. On the other hand, planar surface representations are also utilized to solve the occlusionawareness in indoor scene understanding [11].…”
Section: Introductionmentioning
confidence: 99%