2022
DOI: 10.1109/access.2022.3148127
|View full text |Cite
|
Sign up to set email alerts
|

A Unifying View of Estimation and Control Using Belief Propagation With Application to Path Planning

Abstract: The use of estimation techniques on stochastic models to solve control problems is an emerging paradigm that falls under the rubric of Active Inference (AI) and Control as Inference (CAI). In this work, we use probability propagation on factor graphs to show that various algorithms proposed in the literature can be seen as specific composition rules in a factor graph. We show how this unified approach, presented both in probability space and in log of the probability space, provides a very general framework th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…which, for example, can be derived from message propagation through the use of the classic sum-product rule [22,24]. Note that these are the only functions needed for our purposes because the optimal policy at time t can be obtained by By rigorously applying Bayes' theorem and marginalization, the various messages propagate within the network and contribute to the determination of the posteriors through the simple multiplication of the relative forward and backward messages [14,25]. In particular, from Figure 2, it can be seen that the calculation of the distribution for the policy at time t can generally be rewritten as…”
Section: The Factor Graphmentioning
confidence: 99%
See 1 more Smart Citation
“…which, for example, can be derived from message propagation through the use of the classic sum-product rule [22,24]. Note that these are the only functions needed for our purposes because the optimal policy at time t can be obtained by By rigorously applying Bayes' theorem and marginalization, the various messages propagate within the network and contribute to the determination of the posteriors through the simple multiplication of the relative forward and backward messages [14,25]. In particular, from Figure 2, it can be seen that the calculation of the distribution for the policy at time t can generally be rewritten as…”
Section: The Factor Graphmentioning
confidence: 99%
“…where R(s t , a t ) = ln p(a t ) + r(s t , a t ). Although, in the classic sum-product algorithm, these blocks correspond to a marginalization process, it is still possible to demonstrate that the simple reassignment of different procedures to them allows one to obtain different types of algorithms within the same model [25]. Supplementary Appendix S1 presents various algorithms that can be used simply by modifying the function within the previous blocks, and which, therefore, show the generality of this framework, while Table 1 summarizes the related equations by setting Q(s t , a t ) Q (S t ,A t ) (1) (s t , a t ) and V(s t ) V S t (s t ).…”
Section: The Factor Graphmentioning
confidence: 99%