“…The learning of kernel functions in operators is such a problem: given data tpu k , f k qu N k"1 in suitable function spaces, we would like to learn an optimal kernel function φ fitting the operator R φ puq " f to the data. Such a need for learning operators between function spaces has become vital in applications ranging from integral operators solving PDEs and image processing (see e.g., [12,20,24,29]), nonlinear operators in mean-field equation of interacting particle systems in [22,26], homogenized nonlocal operators (see e.g., [25,38,39]), just to name a few. Since there is often limited information to derive a parametric form, the kernel has to be learnt in a nonparametric fashion.…”