2019 International Conference on Robotics and Automation (ICRA) 2019
DOI: 10.1109/icra.2019.8794213
|View full text |Cite
|
Sign up to set email alerts
|

Task-Driven Estimation and Control via Information Bottlenecks

Abstract: This paper presents a reinforcement learning approach to synthesizing task-driven control policies for robotic systems equipped with rich sensory modalities (e.g., vision or depth). Standard reinforcement learning algorithms typically produce policies that tightly couple control actions to the entirety of the system's state and rich sensor observations. As a consequence, the resulting policies can often be sensitive to changes in taskirrelevant portions of the state or observations (e.g., changing background c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 66 publications
(76 reference statements)
0
10
0
Order By: Relevance
“…In direct data-driven control design, we would like to use the input-output behavior data matrix " U J Y J ‰ J directly in place of the model p P in (7). The key idea here is to avoid identifying a model of input-output behaviors and to directly search for the optimal behavior for the task (2) within the span of observed behaviors contained in the data matrix " U J Y J ‰ J .…”
Section: A Direct Data-driven Controlmentioning
confidence: 99%
See 1 more Smart Citation
“…In direct data-driven control design, we would like to use the input-output behavior data matrix " U J Y J ‰ J directly in place of the model p P in (7). The key idea here is to avoid identifying a model of input-output behaviors and to directly search for the optimal behavior for the task (2) within the span of observed behaviors contained in the data matrix " U J Y J ‰ J .…”
Section: A Direct Data-driven Controlmentioning
confidence: 99%
“…The above is the sense in which the direct data-driven control design formulation employs implicitly an estimate p P direct of the true behavior model P as stated earlier, and can be connected to the general data-driven control design formulation (7). We now obtain the minimizer p u direct in (10) as:…”
Section: A Direct Data-driven Controlmentioning
confidence: 99%
“…Literature Review: Recent work has applied information bottleneck theory [32] to build controllers that focus on actionable, task-relevant visual inputs for robust, generalizable navigation and grasping policies [27,26,29]. In contrast, we introduce a novel algorithm for co-designing communication and machine perception, which uses pre-trained task modules to learn salient, efficiently-computable representations.…”
Section: Train-time Onlymentioning
confidence: 99%
“…A different line of work learns task-centric memory representations via information bottlenecks [22,23]. These approaches seek policies with "low complexity" as defined in terms of the information contained in the memory representation.…”
Section: Related Workmentioning
confidence: 99%
“…These approaches seek policies with "low complexity" as defined in terms of the information contained in the memory representation. For example, in [23], the objective is to minimize the information content about the state in the memory representation. Our work, instead, defines memory complexity in terms of the dimension; such a measure of complexity is more physically meaningful and tied to the robotic system's onboard memory constraints.…”
Section: Related Workmentioning
confidence: 99%