2018
DOI: 10.48550/arxiv.1811.08067
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions

Abstract: We consider artificial agents that learn to jointly control their gripper and camera in order to reinforcement learn manipulation policies in the presence of occlusions from distractor objects. Distractors often occlude the object of interest and cause it to disappear from the field of view. We propose hand/eye controllers that learn to move the camera to keep the object within the field of view and visible, in coordination to manipulating it to achieve the desired goal, e.g., pushing it to a target location. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…In [34] an active vision dataset and a reinforcement learning based baseline to explore the environment to detect objects are proposed. [35] presents a method to jointly learn a policy for both grasping and viewing. The method uses a simulated environment which detects objects depending on the occlusion rate.…”
Section: Related Workmentioning
confidence: 99%
“…In [34] an active vision dataset and a reinforcement learning based baseline to explore the environment to detect objects are proposed. [35] presents a method to jointly learn a policy for both grasping and viewing. The method uses a simulated environment which detects objects depending on the occlusion rate.…”
Section: Related Workmentioning
confidence: 99%
“…We do not plan based on detailed geometric models, but instead leverage coarser geometric information, together with semantic information, and between hierarchical levels of the 3D scene, to guide the search. Recent works used neural networks to visually reason about piles of objects and learn how to manipulate it with the objective of finding a target object [1,8,[23][24][25]. Different to ours, their method is restricted to the manipulation of the pile and does not leverage semantic cues.…”
Section: Related Workmentioning
confidence: 99%
“…Occlusion reasoning in robotics: Object occlusions pose a huge challenge for robot learning: even simple distractors that occasionally occlude the object can cause state-of-the-art RL algorithms to fail [22]. Theoretically, occlusions can be modeled using the framework of POMDP [23], but usually such a formulation is intractable to solve, especially when the states are image observations.…”
Section: Related Workmentioning
confidence: 99%
“…Other than POMDP, there has been lots of work aiming to solve the occlusion problem. Cheng, et al [22] use active vision which learns a policy to move the camera to avoid occlusions. To track the possibly occluded pixels, Ebert, et al [24] use a Conv-LSTM with temporal skip connections to copy pixels from prior images in the history.…”
Section: Related Workmentioning
confidence: 99%