Discrete time nonlinear filters with informative observations are stable

Handel, Ramon van

doi:10.1214/ecp.v13-1423

Cited by 21 publications

(27 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Note that this result is an improvement over [28,Theorem 2.2]. The theorem concluded bounded Lipschitz merging of the filter in expectation, which is weaker than weak convergence in expectation.…”

Section: Proof [Theorem 42]mentioning

confidence: 66%

“…The result of Blackwell and Dubins [3] pairs with uniform observability, in that (3.2) directly follows from Blackwell and Dubins. Then uniform observability would imply filter stability in bounded Lipschitz distance [28]. van Handel proves this in [28], however the author only studied the measurement channel where h(x, z) = f (x) + z where f −1 is uniformly continuous and Z must have an everywhere non-zero characteristic function (e.g.…”

Section: Notation and Preliminariesmentioning

confidence: 99%

“…However, we would like to consider more stringent notions of stability for the filter, as well as stability in relative entropy for both the predictor and filter. Under different assumptions specific results can be developed , for example [28,Lemma 4.2] which establishes the total variation merging of the filter in expectation from that of the predictor using non-degeneracy. However, by examining the form of the Radon Nikodym derivative of P µ and P ν restricted and conditioned on different sigma fields, we can gain significant insight into how these different notions of stability relate to one another.…”

Section: Total Variation Merging Of the Onementioning

confidence: 99%

See 2 more Smart Citations

Stability of Non-Linear Filters, Observability and Relative Entropy

McDonald

Yüksel

2018

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

View full text Add to dashboard Cite

Filter stability is a classical problem for partially observed Markov processes (POMP). For a POMP, an incorrectly initialized non-linear filter is said to be stable if the filter eventually corrects itself with the arrival of new measurement information. In the literature, studies on the stability of non-linear filters either focus on the ergodic properties on the hidden Markov process, or the informativeness/observability properties of the measurement channel. While notions of observability exist in the literature, they are often difficult to verify and specific examples of observable systems are mostly restricted to additive noise models with additional strict regularity properties. In this paper, we introduce a general definition of observability for stochastic non-linear dynamical systems and compare it with related findings in the literature. Our observability notion involves a functional characterization which is easily computed for a variety of systems as we demonstrate. Under this observability definition we establish filter stability results for a variety of criteria including weak merging and total variation merging, both in expectation and in an almost sure sense, as well as relative entropy. We consider the implications between these notions, which unify various results in the literature in a concise manner. Our conditions, and the examples we study, complement and generalize the existing results on filter stability.

show abstract

Section: Proof [Theorem 42]mentioning

confidence: 66%

Section: Notation and Preliminariesmentioning

confidence: 99%

Section: Total Variation Merging Of the Onementioning

confidence: 99%

See 1 more Smart Citation

Stability of Non-Linear Filters, Observability and Relative Entropy

McDonald

Yüksel

2018

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

View full text Add to dashboard Cite

show abstract

“…The problems we consider are also related to, in the control-free context, the theory and applications of non-linear filtering with incorrect initial specifications. Here, the problem is to identify conditions on when an incorrectly initialized non-linear filter asymptotically gets corrected with the accumulation of additional measurements; these often require strong ergodicity properties of the Markov process [8,9,11] or regularity properties (such as absolute continuity) of incorrect prior with respect to the true one and conditions on the measurement processes [24].…”

Section: Introductionmentioning

confidence: 99%

Robustness to Incorrect Priors in Infinite Horizon Stochastic Control

Kara

Yüksel

2018

2018 IEEE Conference on Decision and Control (CDC)

View full text Add to dashboard Cite

We study the continuity properties of optimal solutions to stochastic control problems with respect to initial probability measures and applications of these to the robustness of optimal control policies applied to systems with incomplete or incorrect priors. It is shown that for single and multi-stage optimal cost problems, continuity and robustness cannot be established under weak convergence in general, but that the optimal expected cost is continuous in the priors under the convergence in total variation under mild conditions. By imposing further assumptions on the measurement models, robustness and continuity also hold under weak convergence of priors. We thus obtain robustness results and bounds on the mismatch error that occurs due to the application of a control policy which is designed for an incorrectly estimated prior as the incorrect prior converges to the true one. Positive and negative practical implications of these results in empirical learning for stochastic control are presented, where almost surely weak convergence of i.i.d. empirical measures occurs but stronger notions of convergence, such as total variation convergence, in general, do not.where P is the (prior) distribution of the initial state X 0 , andwhere T (·|x, u) is a stochastic kernel from X× U to X and Q(·|x) is a stochastic kernel from X to Y.We let the objective of the agent be the minimization of the cost for the static or single stage case,over the set of admissible policies γ ∈ Γ, where c : X × U → R is a Borel-measurable stagewise cost function and E Q,γ P denotes the expectation with initial state probability measure P and measurement channel Q under policy γ. Note that P ∈ P(X), where we let P(X) denote the set of probability measures on X.For the multi-stage case, we will discuss the discounted cost infinite horizon setting, with the following cost criterion to be minimized.for some β ∈ (0, 1).We define the optimal cost for the single-stage and the discounted infinite horizon as a function of the priors asProof. We use that β (P, Q).From inequalities (2.4), (2.5) and (2.6) we have that |J β (P, Q, γ * Pn ) − J β (P n , Q, γ * Pn )| is upper bounded as P n (dx 0 ) − P (dx 0 ) T V 1 1−β c ∞ . The analysis is then complete by considering Theorem 2.13.We now develop a robustness result under weak convergence of priors for multi-stage case. First, we give a lemma showing that for any multi-stage setting with a controlled Markov chain satisfying Assumption 2.3, the cost at any time stage is continuous in priors under weak convergence. THEOREM 3.3. Under Assumption 2.3, as P n → P weakly, we have, |J β (P, T , γ * Pn ) − J * β (P, T )| → 0Proof. We use the following bound again,

show abstract

“…This is a mild condition in classical filtering models that serves mainly to rule out the singular case of noiseless observations: for example, the addition of any observation noise to the above counterexample would render the filter ergodic. On the other hand, even in the noiseless case, ergodicity is inherited in the absence of certain symmetries that are closely related to systems-theoretic notions of observability [48,49,51,9]. One can therefore conclude that while there exist elementary examples where the ergodicity of the model fails to be inherited by the filter, such examples must be very fragile as they require both a singular observation structure and the presence of unusual symmetries, either of which is readily broken by a small perturbation of the model.…”

Section: Introductionmentioning

confidence: 99%

Phase transitions in nonlinear filtering

Rebeschini

Handel

2015

Electron. J. Probab.

Self Cite

View full text Add to dashboard Cite

It has been established under very general conditions that the ergodic properties of Markov processes are inherited by their conditional distributions given partial information. While the existing theory provides a rather complete picture of classical filtering models, many infinite-dimensional problems are outside its scope. Far from being a technical issue, the infinite-dimensional setting gives rise to surprising phenomena and new questions in filtering theory. The aim of this paper is to discuss some elementary examples, conjectures, and general theory that arise in this setting, and to highlight connections with problems in statistical mechanics and ergodic theory. In particular, we exhibit a simple example of a uniformly ergodic model in which ergodicity of the filter undergoes a phase transition, and we develop some qualitative understanding as to when such phenomena can and cannot occur. We also discuss closely related problems in the setting of conditional Markov random fields.

show abstract

Discrete time nonlinear filters with informative observations are stable

Cited by 21 publications

References 14 publications

Stability of Non-Linear Filters, Observability and Relative Entropy

Stability of Non-Linear Filters, Observability and Relative Entropy

Robustness to Incorrect Priors in Infinite Horizon Stochastic Control

Phase transitions in nonlinear filtering

Contact Info

Product

Resources

About