Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments

Yamada, Jun; Collins, Jack; Posner, Ingmar

doi:10.48550/arxiv.2303.03365

Cited by 2 publications

(5 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Meanwhile, Lee et al (2020) quantify uncertainty in pose estimation to determine a binary switching strategy using model-based or RL policies. Additionally, Yamada et al (2023) implemented an object-centric generative model to identify goals for motion planning and a skill transition network to facilitate the movement of the endeffector from its terminal state in motion planning to viable starting states of a sample-efficient RL policy. However, these methods require the model of the object, in particular, the manual specification of a goal state in the robot's frame and control policy design (Yamada et al, 2023).…”

Section: Figurementioning

confidence: 99%

“…Additionally, Yamada et al (2023) implemented an object-centric generative model to identify goals for motion planning and a skill transition network to facilitate the movement of the endeffector from its terminal state in motion planning to viable starting states of a sample-efficient RL policy. However, these methods require the model of the object, in particular, the manual specification of a goal state in the robot's frame and control policy design (Yamada et al, 2023). Additionally, they face difficulties in providing comprehensive guidance in both free space and contactrich regions due to the limited motion planning in tasks that require environmental interaction and the scarcity of visual servoing in addressing geometric constraints.…”

Section: Figurementioning

confidence: 99%

“…In response to these challenges, this study proposes a novel OEC task representation within imitation learning (IL) and residual RL frameworks, which are tailored to enable the learning of adaptive operations from minimal demonstrations and interactions. This approach builds upon these foundations of the prior vision model from the model-based methods (Lee et al, 2020;Shi et al, 2021a;Yamada et al, 2023) and the mathematical model from the imitation learning-based methods (Carvalho et al, 2022;Davchev et al, 2022). Our approach distinguishes itself by: (1) streamlining robot programming through extracting via-points from demonstrated end-effector trajectories for task representation, thereby simplifying the reconfiguration costs and improving adaptability.…”

Section: Imitation Learning-based Base Policymentioning

confidence: 99%

See 2 more Smart Citations

Extended residual learning with one-shot imitation learning for robotic assembly in semi-structured environment

Wang,

Su,

Sun

et al. 2024

Front. Neurorobot.

View full text Add to dashboard Cite

IntroductionRobotic assembly tasks require precise manipulation and coordination, often necessitating advanced learning techniques to achieve efficient and effective performance. While residual reinforcement learning with a base policy has shown promise in this domain, existing base policy approaches often rely on hand-designed full-state features and policies or extensive demonstrations, limiting their applicability in semi-structured environments.MethodsIn this study, we propose an innovative Object-Embodiment-Centric Imitation and Residual Reinforcement Learning (OEC-IRRL) approach that leverages an object-embodiment-centric (OEC) task representation to integrate vision models with imitation and residual learning. By utilizing a single demonstration and minimizing interactions with the environment, our method aims to enhance learning efficiency and effectiveness. The proposed method involves three key steps: creating an object-embodiment-centric task representation, employing imitation learning for a base policy using via-point movement primitives for generalization to different settings, and utilizing residual RL for uncertainty-aware policy refinement during the assembly phase.ResultsThrough a series of comprehensive experiments, we investigate the impact of the OEC task representation on base and residual policy learning and demonstrate the effectiveness of the method in semi-structured environments. Our results indicate that the approach, requiring only a single demonstration and less than 1.2 h of interaction, improves success rates by 46% and reduces assembly time by 25%.DiscussionThis research presents a promising avenue for robotic assembly tasks, providing a viable solution without the need for specialized expertise or custom fixtures.

show abstract

Section: Figurementioning

confidence: 99%

Section: Figurementioning

confidence: 99%

Section: Imitation Learning-based Base Policymentioning

confidence: 99%

See 1 more Smart Citation

Extended residual learning with one-shot imitation learning for robotic assembly in semi-structured environment

Wang,

Su,

Sun

et al. 2024

Front. Neurorobot.

View full text Add to dashboard Cite

show abstract

“…This enables the models to better understand the underlying structure of a scene and capture the relationships between its constituent objects. Early works have conducted unsupervised scene inference and generation in 2D (MONet [22], Slot Attention [9], GENESIS [23], GENESIS-V2 [10]), and for robotics applications using APEX [24], [25]. In both [24], [25] the 2D OCGM, APEX, is utilised for object matching using the learned object-centric latent representation in an object rearrangement task in simulation and a peg-in-hole task in the real world respectively.…”

Section: Related Workmentioning

confidence: 99%

“…Early works have conducted unsupervised scene inference and generation in 2D (MONet [22], Slot Attention [9], GENESIS [23], GENESIS-V2 [10]), and for robotics applications using APEX [24], [25]. In both [24], [25] the 2D OCGM, APEX, is utilised for object matching using the learned object-centric latent representation in an object rearrangement task in simulation and a peg-in-hole task in the real world respectively. Nevertheless, the 2D reconstruction and 2D bounding boxes predicted by such OCGM are of limited use in a 3D world.…”

Section: Related Workmentioning

confidence: 99%

APEX: Unsupervised, Object-Centric Scene Segmentation and Tracking for Robot Manipulation

Jones

Engelcke

et al. 2021

2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

Recent advances in unsupervised learning for object detection, segmentation, and tracking hold significant promise for applications in robotics. A common approach is to frame these tasks as inference in probabilistic latent-variable models. In this paper, however, we show that the current state-of-the-art struggles with visually complex scenes such as typically encountered in robot manipulation tasks. We propose APEX, a new latent-variable model which is able to segment and track objects in more realistic scenes featuring objects that vary widely in size and texture, including the robot arm itself. This is achieved by a principled mask normalisation algorithm and a high-resolution scene encoder. To evaluate our approach, we present results on the real-world Sketchy dataset. This dataset, however, does not contain ground truth masks and object IDs for a quantitative evaluation. We thus introduce the Panda Pushing Dataset (P2D) which shows a Panda arm interacting with objects on a table in simulation and which includes groundtruth segmentation masks and object IDs for tracking. In both cases, APEX comprehensively outperforms the current state-ofthe-art in unsupervised object segmentation and tracking. We demonstrate the efficacy of our segmentations for robot skill execution on an object arrangement task, where we also achieve the best or comparable performance among all the baselines.

show abstract

Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments

Cited by 2 publications

References 34 publications

Extended residual learning with one-shot imitation learning for robotic assembly in semi-structured environment

Extended residual learning with one-shot imitation learning for robotic assembly in semi-structured environment

APEX: Unsupervised, Object-Centric Scene Segmentation and Tracking for Robot Manipulation

Contact Info

Product

Resources

About