2021
DOI: 10.48550/arxiv.2102.10820
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SALT: A Semi-automatic Labeling Tool for RGB-D Video Sequences

Dennis Stumpf,
Stephan Krauß,
Gerd Reis
et al.

Abstract: Large labeled data sets are one of the essential basics of modern deep learning techniques. Therefore, there is an increasing need for tools that allow to label large amounts of data as intuitively as possible. In this paper, we introduce SALT, a tool to semi-automatically annotate RGB-D video sequences to generate 3D bounding boxes for full six Degrees of Freedom (DoF) object poses, as well as pixel-level instance segmentation masks for both RGB and depth. Besides bounding box propagation through various inte… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 22 publications
0
5
0
Order By: Relevance
“…Both datasets use the Kinect Azure RGB-D camera. The annotation of the data was done manually, assisted by some semi-automatic functionalities of our annotation tool, such as the transfer of bounding box proposals between subsequent frames [21]. Table 1 summarizes the attributes of the two datasets.…”
Section: Datasetsmentioning
confidence: 99%
“…Both datasets use the Kinect Azure RGB-D camera. The annotation of the data was done manually, assisted by some semi-automatic functionalities of our annotation tool, such as the transfer of bounding box proposals between subsequent frames [21]. Table 1 summarizes the attributes of the two datasets.…”
Section: Datasetsmentioning
confidence: 99%
“…SALT [12] proposes to use a GrabCut based approach to speed up labeling of RGB-D data. Similarly, their method does not require object or environment models.…”
Section: B 3d Annotationmentioning
confidence: 99%
“…Stumpf et al [20] describe the layout of images with 3D bounding boxes using an RGB-D camera. Images were labeled using the information about the current depth-map of the frame.…”
Section: Related Workmentioning
confidence: 99%