“…Advances in DL techniques have led to significant progress not only in the areas of target detection [ 1 , 2 , 3 ] and image segmentation [ 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 ], but also significant progress has been made in pose estimation using these techniques. They can be classified based on the types of datasets into (1) approaches relying on real datasets [ 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 ]; and (2) approaches based on synthetic data [ 24 , 25 , 26 , 27 , 28 , 29 , 30 , 31 , 32 ]. However, the need for labeled real datasets raises a challenge due to the time-consuming and labor-intensive nature of their production, resulting in high dataset production costs [ 33 ].…”