A Bio-Inspired Incremental Learning Architecture for Applied Perceptual Problems

Gepperth, Alexander; Karaoguz, Cem

doi:10.1007/s12559-016-9389-5

Cited by 134 publications

(90 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However to be fair, this parameter search would also have to be performed for convolutional neural network (CNN) and is a property of all deep architectures based on local receptive fields. Given that prototype-based methods in machine learning have a number of highly desirable properties, such as online and incremental learning capacity [7,6], a simple probabilistic interpretation [12] and a natural way of processing multi-class problems, the reduction of resource requirements even when treating complex visual problems seems an important step towards wide-spread use of prototype-based machine learning methods.…”

Section: Resultsmentioning

confidence: 99%

“…1, we use a prototype-based learning algorithm which is loosely based on the self-organizing map model, see [7]. Inputs are represented by graded neural activities arranged in maps organized on a two-dimensional grid lattice.…”

Section: Methodsmentioning

confidence: 99%

“…For any map X, the map activity z X ij ∈ Z X at position (i, j) is then derived from the Euclidean distance between the unit prototype w X ij and the current input x: where, as described in [7], g κ (·) is a Gaussian function with an adaptive parameter κ that converts distances into the [0, 1] interval, and f(·) is a monotonous non-linear transfer function, defined as :…”

Section: Methodsmentioning

confidence: 99%

“…A very popular prototype-based method in computer vision in is particle filtering [5], where a continuous, evolving probability density function is described and updated as a set of prototypes (here denoted particles) whose local density represents local probability density. Prototype-based methods are well suited for incremental learning [6,7] since prototypes have a very obvious interpretation, and can thus be manipulated easily, e.g., by adding, adapting or removing prototypes (see [8] for a precise definition of incremental learning).…”

Section: Introductionmentioning

confidence: 99%

“…An obvious problem of such flat architectures is the curse of dimensionality: complex probability distributions in high-dimensional spaces may conceivably require a great number of prototypes to be well approximated, so the memory requirements of flat prototype-based learning can become excessive depending on the problem at hand [9]. This study generalizes "flat" prototype-based learning as presented in [7] to a "deep" architecture (see Fig. 1), with localized receptive fields in the lower layers, just as it is the case in convolutional neural networks (CNNs, see [10]).…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Computational Advantages of Deep Prototype-Based Learning

Hecht

Gepperth

2016

Artificial Neural Networks and Machine Learning – ICANN 2016

Self Cite

View full text Add to dashboard Cite

Abstract. We present a deep prototype-based learning architecture which achieves a performance that is competitive to a conventional, shallow prototype-based model but at a fraction of the computational cost, especially w.r.t. memory requirements. As prototype-based classification and regression methods are typically plagued by the exploding number of prototypes necessary to solve complex problems, this is an important step towards efficient prototype-based classification and regression. We demonstrate these claims by benchmarking our deep prototype-based model on the well-known MNIST dataset.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Computational Advantages of Deep Prototype-Based Learning

Hecht

Gepperth

2016

Artificial Neural Networks and Machine Learning – ICANN 2016

Self Cite

View full text Add to dashboard Cite

show abstract

Materials Research at Shanghai Jiao Tong University

Chen

Feng

2015

Advanced Materials

View full text Add to dashboard Cite

Transformer architectures have exhibited remarkable performance in image super-resolution (SR). Since the quadratic computational complexity of the self-attention (SA) in Transformer, existing methods tend to adopt SA in a local region to reduce overheads. However, the local design restricts the global context exploitation, which is critical for accurate image reconstruction. In this work, we propose the Recursive Generalization Transformer (RGT) for image SR, which can capture global spatial information and is suitable for high-resolution images. Specifically, we propose the recursive-generalization self-attention (RG-SA). It recursively aggregates input features into representative feature maps, and then utilizes cross-attention to extract global information. Meanwhile, the channel dimensions of attention matrices (query, key, and value) are further scaled for a better trade-off between computational overheads and performance. Furthermore, we combine the RG-SA with local self-attention to enhance the exploitation of the global context, and propose the hybrid adaptive integration (HAI) for module integration. The HAI allows the direct and effective fusion between features at different levels (local or global). Extensive experiments demonstrate that our RGT outperforms recent state-of-the-art methods.

show abstract