Topological Properties of the Set of Functions Generated by Neural Networks of Fixed Size

Petersen, Philipp; Raslan, Mones; Voigtlaender, Felix

doi:10.1007/s10208-020-09461-0

Cited by 56 publications

(58 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…MAP means a polynomial model fit was carried out first, DATA means that the ISF was directly fitted to the data, O(α) indicates that order α polynomials were used deep neural networks [5,12], or other kinds of nonlinear approximation methods [9], that allow to represent high-dimensional functions with reasonable efficiency as opposed to polynomials. The challenge with nonlinear approximations, in particular with neural networks, is that they can be difficult to fit to data, because the dis-tance between parameters that provide small improvements in accuracy can be large and therefore not easy to find [21]. Nevertheless, deep neural networks have enabled great advances in many fields of engineering and therefore this approach will be explored elsewhere.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Invariant spectral foliations with applications to model order reduction and synthesis

Szalai

2020

Nonlinear Dyn

View full text Add to dashboard Cite

The paper introduces a technique that decomposes the dynamics of a nonlinear system about an equilibrium into low-order components, which then can be used to reconstruct the full dynamics. This is a nonlinear analogue of linear modal analysis. The dynamics is decomposed using Invariant Spectral Foliation (ISF), which is defined as the smoothest invariant foliation about an equilibrium and hence unique under general conditions. The conjugate dynamics of an ISF can be used as a reduced order model. An ISF can be fitted to vibration data without carrying out a model identification first. The theory is illustrated on a analytic example and on free-vibration data of a clamped-clamped beam.

show abstract

Section: Discussionmentioning

confidence: 99%

“…We can accurately identify the dynamics on an ISF and determine its instantaneous damping ratio (22) and angular frequency (21). It is, however, not possible to attach a unique amplitude to a leaf within a foliation.…”

Section: The Backbone and Damping Curves Of An Isfmentioning

confidence: 99%

Invariant spectral foliations with applications to model order reduction and synthesis

Szalai

2020

Nonlinear Dyn

View full text Add to dashboard Cite

show abstract

“…We would like to emphasize that the finite-dimensional problem (5) is not any easier than the infinite-dimensional problem (4); they simply require different techniques. In particular, our results do not follow from the results in [3,7,14] for infinitedimensional spaces -we will have more to say about this in Section 2.…”

Section: Introductionmentioning

confidence: 58%

Best k-Layer Neural Network Approximations

2021

View full text Add to dashboard Cite

We show that the empirical risk minimization (ERM) problem for neural networks has no solution in general. Given a training set s 1 , . . . , s n ∈ R p with corresponding responses t 1 , . . . , t n ∈ R q , fitting a k-layer neural network ν θ : R p → R q involves estimation of the weights θ ∈ R m via an ERM:We show that even for k = 2, this infimum is not attainable in general for common activations like ReLU, hyperbolic tangent, and sigmoid functions. In addition, we deduce that if one attempts to minimize such a loss function in the event when its infimum is not attainable, it necessarily results in values of θ diverging to ±∞. We will show that for smooth activations σ(x) = 1/ 1 + exp(−x) and σ(x) = tanh(x), such failure to attain an infimum can happen on a positive-measured subset of responses. For the ReLU activation σ(x) = max(0, x), we completely classify cases where the ERM for a best two-layer neural network approximation attains its infimum. In recent applications of neural networks, where overfitting is commonplace, the failure to attain an infimum is avoided by ensuring that the system of equations t i = ν θ (s i ), i = 1, . . . , n, has a solution. For a two-layer ReLU-activated network, we will

show abstract

“…The construction of Example 4 parallels the formulation given in [46,47]. However, in [47] elements of F are referred to as neural networks and functions in N N (F , ) are called their realizations.…”

Section: Remarkmentioning

confidence: 95%

The Universal Approximation Property

Kratsios

2021

Ann Math Artif Intell

View full text Add to dashboard Cite

The universal approximation property of various machine learning models is currently only understood on a case-by-case basis, limiting the rapid development of new theoretically justified neural network architectures and blurring our understanding of our current models’ potential. This paper works towards overcoming these challenges by presenting a characterization, a representation, a construction method, and an existence result, each of which applies to any universal approximator on most function spaces of practical interest. Our characterization result is used to describe which activation functions allow the feed-forward architecture to maintain its universal approximation capabilities when multiple constraints are imposed on its final layers and its remaining layers are only sparsely connected. These include a rescaled and shifted Leaky ReLU activation function but not the ReLU activation function. Our construction and representation result is used to exhibit a simple modification of the feed-forward architecture, which can approximate any continuous function with non-pathological growth, uniformly on the entire Euclidean input space. This improves the known capabilities of the feed-forward architecture.

show abstract

Topological Properties of the Set of Functions Generated by Neural Networks of Fixed Size

Cited by 56 publications

References 36 publications

Invariant spectral foliations with applications to model order reduction and synthesis

Invariant spectral foliations with applications to model order reduction and synthesis

Best k-Layer Neural Network Approximations

The Universal Approximation Property

Contact Info

Product

Resources

About