Minimum-information LQG control part I: Memoryless controllers

Fox, Roy; Tishby, Naftali

doi:10.1109/cdc.2016.7799131

Cited by 14 publications

(12 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Combining equation ( 13) and ( 14) yields (12). We now show that K φ (0 T ) ≤ log 2 (T )+c 0 , for some constant c 0 independent of T , where 0 T is the string consisting of T zeros.…”

Section: Appendixmentioning

confidence: 65%

“…In the second category, a low-complexity policy is instead obtained directly. Here, notable methods include policy distillation [30], VC-dimension constraints [16], concise finitestate machine plans [23], [24], low-memory policies through sparsity constraints [7], and information-theoretic approaches such as KL-regularisation [27], [35], mutual information regularisation with variations [33], [12], [34], and minimal specification complexity [11], [10]. Our work belongs to this second category and resembles [23], [24], [11], [10] the most, but differ since we consider Kolmogorov complexity.…”

Section: B Contributionmentioning

confidence: 99%

See 1 more Smart Citation

Computing Complexity-aware Plans Using Kolmogorov Complexity

Stefansson¹,

Johansson²

2021

Preprint

View full text Add to dashboard Cite

In this paper, we introduce complexity-aware planning for finite-horizon deterministic finite automata with rewards as outputs, based on Kolmogorov complexity. Kolmogorov complexity is considered since it can detect computational regularities of deterministic optimal policies. We present a planning objective yielding an explicit trade-off between a policy's performance and complexity. It is proven that maximising this objective is non-trivial in the sense that dynamic programming is infeasible. We present two algorithms obtaining low-complexity policies, where the first algorithm obtains a lowcomplexity optimal policy, and the second algorithm finds a policy maximising performance while maintaining local (stagewise) complexity constraints. We evaluate the algorithms on a simple navigation task for a mobile robot, where our algorithms yield low-complexity policies that concur with intuition.

show abstract

“…Combining equation ( 13) and ( 14) yields (12). We now show that K φ (0 T ) ≤ log 2 (T )+c 0 , for some constant c 0 independent of T , where 0 T is the string consisting of T zeros.…”

Section: Appendixmentioning

confidence: 65%

Section: B Contributionmentioning

confidence: 99%

Computing Complexity-aware Plans Using Kolmogorov Complexity

Stefansson¹,

Johansson²

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…In information theory, Shannon [82] also introduced a different notion of a separation principle, to explain coding via two (separate) phases of source compression and channel coding [45]. The connections between Shannon's work and the separation principle in control theory have become more clear in recent years, thanks to a growing literature showing how Shannon's definition captures and potentially generalises the results from control theory, see for instance [89,88,30]. Here however, the focus will be on the principle traditionally described in control theory for LQG in continuous systems [54,105], under the following standard assumptions [105,5,14,3,85]:…”

Section: Linear Quadratic Gaussian (Lqg) Controlmentioning

confidence: 99%

On Kalman-Bucy filters, linear quadratic control and active inference

Baltieri¹,

Buckley²

2020

Preprint

View full text Add to dashboard Cite

Linear Quadratic Gaussian (LQG) control is a framework first introduced in control theory that provides an optimal solution to linear problems of regulation in the presence of uncertainty. This framework combines Kalman-Bucy filters for the estimation of hidden states with Linear Quadratic Regulators for the control of their dynamics. Nowadays, LQG is also a common paradigm in neuroscience, where it is used to characterise different approaches to sensorimotor control based on state estimators, forward and inverse models. According to this paradigm, perception can be seen as a process of Bayesian inference and action as a process of optimal control. Recently, active inference has been introduced as a process theory derived from a variational approximation of Bayesian inference problems that describes, among others, perception and action in terms of (variational and expected) free energy minimisation. Active inference relies on a mathematical formalism similar to LQG, but offers a rather different perspective on problems of sensorimotor control in biological systems based on a process of biased perception. In this note we compare the mathematical treatments of these two frameworks for linear systems, focusing on their respective assumptions and highlighting their commonalities and technical differences.

show abstract

“…Our work differs in that we provide analytic (i.e., model-based) methods for finding such representations and policies and we explicitly characterize the resulting robustness. Another branch of work considers the construction of LQG policies that achieve a performance goal while minimizing an information-theoretic quantity such as the mutual information between inputs and outputs [26], [27] or Massey's directed information [28], [29]. In contrast to these works, our derivation handles nonlinear systems and also presents robustness results for the resulting controllers, which have not been discussed to our knowledge in existing literature.…”

Section: A Related Workmentioning

confidence: 99%

Task-Driven Estimation and Control via Information Bottlenecks

Pacelli

Majumdar

2019

2019 International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

This paper presents a reinforcement learning approach to synthesizing task-driven control policies for robotic systems equipped with rich sensory modalities (e.g., vision or depth). Standard reinforcement learning algorithms typically produce policies that tightly couple control actions to the entirety of the system's state and rich sensor observations. As a consequence, the resulting policies can often be sensitive to changes in taskirrelevant portions of the state or observations (e.g., changing background colors). In contrast, the approach we present here learns to create a task-driven representation that is used to compute control actions. Formally, this is achieved by deriving a policy gradient-style algorithm that creates an information bottleneck between the states and the task-driven representation; this constrains actions to only depend on task-relevant information. We demonstrate our approach in a thorough set of simulation results on multiple examples including a grasping task that utilizes depth images and a ball-catching task that utilizes RGB images. Comparisons with a standard policy gradient approach demonstrate that the task-driven policies produced by our algorithm are often significantly more robust to sensor noise and task-irrelevant changes in the environment.

show abstract

Minimum-information LQG control part I: Memoryless controllers

Cited by 14 publications

References 37 publications

Computing Complexity-aware Plans Using Kolmogorov Complexity

Computing Complexity-aware Plans Using Kolmogorov Complexity

On Kalman-Bucy filters, linear quadratic control and active inference

Task-Driven Estimation and Control via Information Bottlenecks

Contact Info

Product

Resources

About