CSA6D: Channel-Spatial Attention Networks for 6D Object Pose Estimation

Chen, Tao; Gu, Dongbing

doi:10.1007/s12559-021-09966-y

Cited by 8 publications

(5 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…He et al [12] incorporate a channel-level attention module for the adaptive feature fusion into U-Net and calculate distances between pixels and keypoints using prior distance augmented loss. Another related architecture based on the channel spatial attention network (CSA6D) is proposed by Chen and Gu [40] to estimate the 6D object pose from RGB-D images.…”

Section: Voting-based Methodsmentioning

confidence: 99%

“…Here, we compare our results with 6D pose estimation approaches using a single RGB image, which are state of the art in this research area. e comparisons have been carried out against PVNet [8], DPVL [11], ASPP-DF-PVNet with L+ loss [7], and PDAL-AFAM approach of He et al (2021) [12] and some previous approaches such as PoseCNN [5], SSD6D [1], YOLO6D [3], BB8 [29], CDPN [32], DPOD [31], Pix2Pose [33], and CSA6D [40]. e results are evaluated using ADD (-S) and 2D-Projection metrics on LINEMOD and occlusion LINEMOD datasets.…”

Section: Comparisons With State Of the Artmentioning

confidence: 99%

“…We include only those results for comparisons that are provided by other methods as 2D projection-based results are not reported by some methods, so we do not include those in Tables 3 and 4. CSA6D [40] reported only 2D projection-based results on the LINEMOD dataset, so we only include those. Table 3 shows the comparison of our method with a number of other methods for pose estimation on the LINEMOD dataset concerning the 2D projection metric.…”

Section: Comparisons Using 2d Projection Metricmentioning

confidence: 99%

See 2 more Smart Citations

A Robust Convolutional Neural Network for 6D Object Pose Estimation from RGB Image with Distance Regularization Voting Loss

Ullah

Daradkeh

et al. 2022

Scientific Programming

View full text Add to dashboard Cite

Six-degree (6D) pose estimation of objects is important for robot manipulation but at the same time challenging when dealing with occluded and textureless objects. To overcome this challenge, the proposed method presents an end-to-end robust network for real-time 6D pose estimation of rigid objects using the RGB image. In this proposed method, a fully convolutional network with a features pyramid is developed that effectively boosts the accuracy of pixelwise labeling and direction unit vector field that take part in the voting process for object keypoints estimation. The network further takes into account measuring the distance between pixel and keypoint, which aims to help select accurate hypotheses in the RANSAC process. This avoids hypothesis deviations caused by the errors due to direction unit vectors in cases of distant pixels from keypoints. A vectorial distance regularization loss function is used to help Perspective-n-Point find 2D-3D correspondences between 3D object keypoints and their estimated corresponding 2D counterparts. Experiments are performed on widely used LINEMOD and occlusion LINEMOD datasets with ADD (-S) and 2D projection evaluation metrics. The results show that our method improves pose estimation performance compared to the state-of-the-art while still achieving real-time efficiency.

show abstract

Section: Voting-based Methodsmentioning

confidence: 99%

Section: Comparisons With State Of the Artmentioning

confidence: 99%

Section: Comparisons Using 2d Projection Metricmentioning

confidence: 99%

See 1 more Smart Citation

A Robust Convolutional Neural Network for 6D Object Pose Estimation from RGB Image with Distance Regularization Voting Loss

Ullah

Daradkeh

et al. 2022

Scientific Programming

View full text Add to dashboard Cite

show abstract

“…There has been great progress in reconstructing or estimating the pose of a single hand [KS12,GRL*19,IMB*18,CCY*21,ZLM*19] or objects [HHFS19, KMT*17, PLH*19, ZSI19, LF20, ZHMW22, LZXQ21, YJLF22, CG22, ZBB21, SHCM21] alone over recent decades. Lacking good datasets labeling hands and objects together, early work on hand‐object interaction focused on recovering either the hand [RKK09, RKI*14] or object [TG15] pose in a interaction.…”

Section: Related Workmentioning

confidence: 99%

Joint Hand and Object Pose Estimation from a Single RGB Image using High‐level 2D Constraints

Song

Martin

2022

Computer Graphics Forum

View full text Add to dashboard Cite

Joint pose estimation of human hands and objects from a single RGB image is an important topic for AR/VR, robot manipulation, etc. It is common practice to determine both poses directly from the image; some recent methods attempt to improve the initial poses using a variety of contact‐based approaches. However, few methods take the real physical constraints conveyed by the image into consideration, leading to less realistic results than the initial estimates. To overcome this problem, we make use of a set of high‐level 2D features which can be directly extracted from the image in a new pipeline which combines contact approaches and these constraints during optimization. Our pipeline achieves better results than direct regression or contact‐based optimization: they are closer to the ground truth and provide high quality contact.

show abstract

“…With recent advances in 3D scanning technologies, it becomes convenient to obtain 3D raw data. As the fundamental 3D representation, point cloud has attracted extensive attention for various 3D applications [1,2]. Recently, researchers focus on exploiting Convolution Neural Networks (CNNs) to process 3D point cloud, which can be generally categorized into three types: projectionbased methods [3,4,5], voxelization-based methods [6,7], and point-based methods [8,9,10,11].…”

Section: Introductionmentioning

confidence: 99%

PointGS: Bridging and fusing geometric and semantic space for 3D point cloud analysis

Jiang

Huang²,

Wu³

et al. 2023

Information Fusion

View full text Add to dashboard Cite

CSA6D: Channel-Spatial Attention Networks for 6D Object Pose Estimation

Cited by 8 publications

References 41 publications

A Robust Convolutional Neural Network for 6D Object Pose Estimation from RGB Image with Distance Regularization Voting Loss

A Robust Convolutional Neural Network for 6D Object Pose Estimation from RGB Image with Distance Regularization Voting Loss

Joint Hand and Object Pose Estimation from a Single RGB Image using High‐level 2D Constraints

PointGS: Bridging and fusing geometric and semantic space for 3D point cloud analysis

Contact Info

Product

Resources

About