“…Therefore, in the following, we only consider noninformative prior distributions, such as uniform distributions, which typically are used when there is no prior information about the parameters. By applying the Bayes theorem and given training data, D = (x, y), the prior distribution can be updated to a posterior probability distribution p(θ|D) = p(θ|x, y) (Gawlikowski et al, 2021). Then, given new observations of the input data, x', a probability distribution over a new realization, y', of the model output is given by:…”