2019
DOI: 10.1126/scirobotics.aav3123
|View full text |Cite
|
Sign up to set email alerts
|

See, feel, act: Hierarchical learning for complex manipulation skills with multisensory fusion

Abstract: Humans are able to seamlessly integrate tactile and visual stimuli with their intuitions to explore and execute complex manipulation skills. They not only see but also feel their actions. Most current robotic learning methodologies exploit recent progress in computer vision and deep learning to acquire data-hungry pixel-to-action policies. These methodologies do not exploit intuitive latent structure in physics or tactile signatures. Tactile reasoning is omnipresent in the animal kingdom, yet it is underdevelo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
68
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 102 publications
(69 citation statements)
references
References 21 publications
(28 reference statements)
1
68
0
Order By: Relevance
“…Such compositional processes (more generally) might be instantiated in nature in terms of the computational principles or neurobiological underpinnings of any adaptive decision-making process. Modular composition may arise, and aid efficacy or efficiency of such processes, in several different settings; for example, composing multiple sensory modalities, such as tactile and visual stimulation (Fazeli et al, 2019); arranging dominance-relations to form a hierarchical representation of a social group (Seyfarth and Cheney, 2018); cognitive reasoning involving hierarchically organised decision-making (Sarafyazd and Jazayeri, 2019); or other such functionaldemand protocols in nature, such as the availability of food, density of populations, and presence of predators in migratory species (Hopcraft et al, 2014).…”
Section: Resultsmentioning
confidence: 99%
“…Such compositional processes (more generally) might be instantiated in nature in terms of the computational principles or neurobiological underpinnings of any adaptive decision-making process. Modular composition may arise, and aid efficacy or efficiency of such processes, in several different settings; for example, composing multiple sensory modalities, such as tactile and visual stimulation (Fazeli et al, 2019); arranging dominance-relations to form a hierarchical representation of a social group (Seyfarth and Cheney, 2018); cognitive reasoning involving hierarchically organised decision-making (Sarafyazd and Jazayeri, 2019); or other such functionaldemand protocols in nature, such as the availability of food, density of populations, and presence of predators in migratory species (Hopcraft et al, 2014).…”
Section: Resultsmentioning
confidence: 99%
“…Reinforcement learning has been previously applied to manipulation learning [21]- [24]. Recently, much research has focused on using deep reinforcement learning to make robots play building block games [25], [26]and significant advances were made. In addition, significant amount of work has been done on robot grasbing [27], [28], door opening [29], navigating [30].…”
Section: B Related Workmentioning
confidence: 99%
“…Yet few algorithms endow robots with a similar capability. While the utility of multimodal data has frequently been shown in robotics [7,20,54,58,66], the proposed manipulation strategies often rely on handcrafted features or prior knowledge about how to solve a task. This makes many of these methods task-specific.…”
Section: Introductionmentioning
confidence: 99%