Uniform Error and Posterior Variance Bounds for Gaussian Process Regression with Application to Safe Control

Lederer, Armin; Umlauft, Jonas; Hirche, Sandra

doi:10.48550/arxiv.2101.05328

Cited by 3 publications

(4 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We tune the hyper-parameters by solving the optimization problem (43). Besides, it is shown by Lederer et al 33,34 that the approximation errors of GPR uniformly converge to zero with a sufficient amount of well-distributed training data.…”

Section: Learning Of the Value Function-nonnegativity-enforced Gprmentioning

confidence: 99%

“…GPR stands out from many machine learning techniques for its ability to generalize well to small training sets and to provide a measure of its own inaccuracy. 33,34 Recalling the procedure in Section 3, at each iteration t, by concatenating the q training input data and the corresponding training output data into matrices Z t and Y t , respectively, the following training data dictionary  t is obtained:…”

Section: Learning Of the Value Function-nonnegativity-enforced Gprmentioning

confidence: 99%

See 1 more Smart Citation

Learning‐based model predictive control under value iteration with finite approximation errors

Lin,

Xia,

Sun

et al. 2023

Intl J Robust & Nonlinear

View full text Add to dashboard Cite

This paper proposes a novel learning‐based model predictive control (LMPC) scheme for discrete‐time nonlinear systems. It overcomes the challenge of manually designing the terminal conditions for traditional MPC and enhances the control performance. The scheme employs the value iteration (VI) in reinforcement learning (RL), and autonomously designs the terminal cost by iteratively performing value function learning and policy update under known dynamics and constraints. In contrast to the existing schemes that combine RL with MPC, the proposed scheme explicitly considers the approximation errors in each iteration. Further, a rigorous theoretical analysis is provided, including the convergence of VI, the stability and performance of the closed‐loop system. In addition, the influences of the prediction horizon and the initial terminal cost on performance are also investigated. Simulation results of a linear system verify the theoretical properties of the LMPC and show that it achieves (near‐)optimal performance. Moreover, its unique superiority over traditional MPC is fully demonstrated in a nonholonomic vehicle regulation example.

show abstract

Section: Learning Of the Value Function-nonnegativity-enforced Gprmentioning

confidence: 99%

Section: Learning Of the Value Function-nonnegativity-enforced Gprmentioning

confidence: 99%

Learning‐based model predictive control under value iteration with finite approximation errors

Lin,

Xia,

Sun

et al. 2023

Intl J Robust & Nonlinear

View full text Add to dashboard Cite

show abstract

“…Note that the bound l σ does not depend neither on the chosen regulator parameters, nor on the initial conditions. Consider now the jump dynamics, under the Assumption 2.5 of Lipschitz continuous kernel, we can explicitly derive an upper bound on the value of σ 2 + at each jump (see [35,Theorem 1])…”

Section: Gaussian Process-based Adaptive Regulationmentioning

confidence: 99%

Adaptive Nonlinear Regulation via Gaussian Process

Gentilini¹,

Bin²,

Marconi³

2022

Preprint

View full text Add to dashboard Cite

The paper deals with the problem of output regulation of nonlinear systems by presenting a learning-based adaptive internal model-based design strategy. We borrow from the adaptive internal model design technique recently proposed in [1] and extend it by means of a Gaussian process regressor. The learning-based adaptation is performed by following an "event-triggered" logic so that hybrid tools are used to analyse the resulting closed-loop system. Unlike the approach proposed in [1] where the friend is supposed to belong to a specific finite-dimensional model set, here we only require smoothness of the ideal steady-state control action. The paper also presents numerical simulations showing how the proposed method outperforms previous approaches.

show abstract

“…An alternative hypothesis is to take the support of the prior distribution of the GP as the belief space from which to seek the true function. This hypothesis has been employed in stochastic bandit problems based on GPs [33,34] and more recently has been used to establish general interpretable bounds for basic GP models [19,35]. The sample space is the largest possible space of candidate functions, and leads to bounds that can be approximated for common settings with relative ease, in comparison to the RKHS approaches.…”

Section: Error Bounds For the Univariate Resgpmentioning

confidence: 99%

Residual Gaussian Process: A Tractable Nonparametric Bayesian Emulator for Multi-fidelity Simulations

Wei¹,

Shah²,

Wang³

et al. 2021

Preprint

View full text Add to dashboard Cite

Challenges in multi-fidelity modelling relate to accuracy, uncertainty estimation and high-dimensionality. A novel additive structure is introduced in which the highest fidelity solution is written as a sum of the lowest fidelity solution and residuals between the solutions at successive fidelity levels, with Gaussian process priors placed over the low fidelity solution and each of the residuals. The resulting model is equipped with a closedform solution for the predictive posterior, making it applicable to advanced, high-dimensional tasks that require uncertainty estimation. Its advantages are demonstrated on univariate benchmarks and on three challenging multivariate problems. It is shown how active learning can be used to enhance the model, especially with a limited computational budget. Furthermore, error bounds are derived for the mean prediction in the univariate case.

show abstract

Uniform Error and Posterior Variance Bounds for Gaussian Process Regression with Application to Safe Control

Cited by 3 publications

References 31 publications

Learning‐based model predictive control under value iteration with finite approximation errors

Learning‐based model predictive control under value iteration with finite approximation errors

Adaptive Nonlinear Regulation via Gaussian Process

Residual Gaussian Process: A Tractable Nonparametric Bayesian Emulator for Multi-fidelity Simulations

Contact Info

Product

Resources

About