Marker-Less 3d Object Recognition and 6d Pose Estimation for Homogeneous Textureless Objects: An RGB-D Approach

Hajari, Nasim; Lugo, Gabriel; Sharma, Harsh; Cheng, Irene

doi:10.3390/s20185098

Cited by 5 publications

(3 citation statements)

References 69 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, it introduces a new loss function, known as the transformer loss for 3D coordinate regression, helping in resolving object symmetry issues. Meanwhile, Hajari et al [ 23 ] proposed a method, based on point cloud template matching, to realize some progress in position estimation of weakly textured objects. Within the pose estimation task, it is challenging to cover all object poses during training by just using real data; thus, acquiring pose labels with ground truth values is difficult to realize in several scenarios.…”

Section: Related Workmentioning

confidence: 99%

“…Advances in DL techniques have led to significant progress not only in the areas of target detection [ 1 , 2 , 3 ] and image segmentation [ 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 ], but also significant progress has been made in pose estimation using these techniques. They can be classified based on the types of datasets into (1) approaches relying on real datasets [ 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 ]; and (2) approaches based on synthetic data [ 24 , 25 , 26 , 27 , 28 , 29 , 30 , 31 , 32 ]. However, the need for labeled real datasets raises a challenge due to the time-consuming and labor-intensive nature of their production, resulting in high dataset production costs [ 33 ].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset

Zheng,

Zhang,

Zhang

et al. 2023

Sensors

View full text Add to dashboard Cite

Due to the difficulty in generating a 6-Degree-of-Freedom (6-DoF) object pose estimation dataset, and the existence of domain gaps between synthetic and real data, existing pose estimation methods face challenges in improving accuracy and generalization. This paper proposes a methodology that employs higher quality datasets and deep learning-based methods to reduce the problem of domain gaps between synthetic and real data and enhance the accuracy of pose estimation. The high-quality dataset is obtained from Blenderproc and it is innovatively processed using bilateral filtering to reduce the gap. A novel attention-based mask region-based convolutional neural network (R-CNN) is proposed to reduce the computation cost and improve the model detection accuracy. Meanwhile, an improved feature pyramidal network (iFPN) is achieved by adding a layer of bottom-up paths to extract the internalization of features of the underlying layer. Consequently, a novel convolutional block attention module–convolutional denoising autoencoder (CBAM–CDAE) network is proposed by presenting channel attention and spatial attention mechanisms to improve the ability of AE to extract images’ features. Finally, an accurate 6-DoF object pose is obtained through pose refinement. The proposed approach is compared to other models using the T-LESS and LineMOD datasets. Comparison results demonstrate the proposed approach outperforms the other estimation models.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset

Zheng,

Zhang,

Zhang

et al. 2023

Sensors

View full text Add to dashboard Cite

show abstract

“…Many works have been interested in adopting AI in industrial vision applications. The work carried out in [4] provides a methodology to recognize the class of an object while estimating its 6D pose with RGB-D data. Specifically, the proposed model adopts a global approach, first recognizing an object and the region of interest (ROI) from RGB images.…”

Section: Introductionmentioning

confidence: 99%