Abstract:We consider the problem of data classification where the training set consists of just a few data points. We explore this phenomenon mathematically and reveal key relationships between the geometry of an AI model's feature space, the structure of the underlying data distributions, and the model's generalisation capabilities. The main thrust of our analysis is to reveal the influence on the model's generalisation capabilities of nonlinear feature transformations mapping the original data into high, and possibly… Show more
“…The problem of learning from a small number of examples is strongly related to the recently discovered phenomenon of dimensionality blessing (Gorban et al, 2016) as opposed to the "curse of dimensionality" (Sutton et al, 2022). The connection between these two problems has a fundamental mathematical nature (Gorban and Tyukin, 2017).…”
“…The problem of learning from a small number of examples is strongly related to the recently discovered phenomenon of dimensionality blessing (Gorban et al, 2016) as opposed to the "curse of dimensionality" (Sutton et al, 2022). The connection between these two problems has a fundamental mathematical nature (Gorban and Tyukin, 2017).…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.