2020
DOI: 10.48550/arxiv.2010.14406
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Transporter Networks: Rearranging the Visual World for Robotic Manipulation

Abstract: Robotic manipulation can be formulated as inducing a sequence of spatial displacements: where the space being moved can encompass an object, part of an object, or end effector. In this work, we propose the Transporter Network, a simple model architecture that rearranges deep features to infer spatial displacements from visual input -which can parameterize robot actions. It makes no assumptions of objectness (e.g. canonical poses, models, or keypoints), it exploits spatial symmetries, and is orders of magnitude… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
81
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(81 citation statements)
references
References 32 publications
0
81
0
Order By: Relevance
“…We evaluated the transformer and Assistive Tele-op system across a variety of tasks representing industrial [18,31], household [4,8], and caregiving [7,10] task scenarios. For this, we used both existing data from the Roboturk [18] simulation dataset covering pick-and-place and nut assembly tasks, as well as new data for other tasks that we collected in VR.…”
Section: Discussionmentioning
confidence: 99%
“…We evaluated the transformer and Assistive Tele-op system across a variety of tasks representing industrial [18,31], household [4,8], and caregiving [7,10] task scenarios. For this, we used both existing data from the Roboturk [18] simulation dataset covering pick-and-place and nut assembly tasks, as well as new data for other tasks that we collected in VR.…”
Section: Discussionmentioning
confidence: 99%
“…Variations on this approach have been explored in [22,23]. The FCN approach has been adapted to a variety of different manipulation tasks with different action primitives [24,25,26,27,28,1,29,30,31]. In this paper, we extend the work above by proposing new equivariant architectures for the spatial action space setting.…”
Section: Related Workmentioning
confidence: 99%
“…Recent work demonstrates the use of data augmentation improves the data efficiency and the policy's performance in reinforcement learning [7,8,9]. In the context of robotics, data augmentation is often used to generate additional samples [1,10,11]. In contrast to learning the equivariance property using data augmentation, our work utilizes the equivariant network to hard code the symmetries in the structure of the network to achieve better sample efficiency.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, with the growing interest of the research community for IL, LfD or ORL, a lot of datasets have been released. They mainly focus on robotics [15,44,48,40], some with a particular focus on human data [31,32,39]. Some works include datasets for discrete action-environments like games [19,28].…”
Section: Related Workmentioning
confidence: 99%