Performance and Scalability of GPU-Based Convolutional Neural Networks

Strigl, Daniel; Kofler, Klaus; Podlipnig, Stefan

doi:10.1109/pdp.2010.43

Cited by 202 publications

(95 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Most previous GPUbased CNN implementations [11], [12] were hard-coded to satisfy GPU hardware constraints, whereas ours is flexible and fully online (i.e., weight updates after each image). Other flexible implementations [13] are not fully exploiting the latest GPUs. It allows for training large CNNs within days instead of months, such that we can investigate the influence of various structural parameters by exploring large parameter spaces [14] and performing error analysis on repeated experiments.…”

Section: Introductionmentioning

confidence: 99%

A committee of neural networks for traffic sign classification

Cireşan

Meier

Masci

et al. 2011

The 2011 International Joint Conference on Neural Networks

392

223

View full text Add to dashboard Cite

Abstract-We describe the approach that won the preliminary phase of the German traffic sign recognition benchmark with a better-than-human recognition rate of 98.98%. We obtain an even better recognition rate of 99.15% by further training the nets. Our fast, fully parameterizable GPU implementation of a Convolutional Neural Network does not require careful design of pre-wired feature extractors, which are rather learned in a supervised way. A CNN/MLP committee further boosts recognition performance.

show abstract

Section: Introductionmentioning

confidence: 99%

A committee of neural networks for traffic sign classification

Cireşan

Meier

Masci

et al. 2011

The 2011 International Joint Conference on Neural Networks

392

223

View full text Add to dashboard Cite

show abstract

“…Moreover, a nonlinear activation (e.g. sigmoid, hyperbolic tangent, rectified linear units) function is taken outside the convolutional layer to strengthen the non-linearity [44]. Specifically, the major operations performed in the CNN can be summarized as:…”

Section: A Convolutional Neural Network (Cnn)mentioning

confidence: 99%

VPRS-Based Regional Decision Fusion of CNN and MRF Classifications for Very Fine Resolution Remotely Sensed Images

Zhang

Sargent

Pan

et al. 2018

IEEE Trans. Geosci. Remote Sensing

View full text Add to dashboard Cite

show abstract

“…The CNN also provides partial resistance and robustness to geometric distortions and transformations, and other 2D shape variations [2]. Hence, the CNN is specifically designed to cope with shortcomings of the traditional feature extractor that is characterized by being static, is designed independently of the trainable classifier, and is not part of training procedure [3]. A final benefit of CNNs is that they are relatively easier to train since they have fewer parameters than fully connected MLP neural networks with the same number of hidden layers.…”

Section: Raw Inputmentioning

confidence: 99%

“…The concepts of local receptive field, weight sharing, and spatial subsampling mentioned above are the three principle architectural ideas behind the design of a CNN [2,6]. In weight sharing topology, all neurons in a feature map use the same incoming set of weights (kernel weights), and feature extraction is performed by convolving the image with these kernels [3,11].…”

Section: Background On Convolutional Neural Networkmentioning

confidence: 99%

Gender classification: a convolutional neural network approach

Liew¹,

Khalil-Hani²,

Radzi³

et al. 2016

Turk J Elec Eng & Comp Sci

View full text Add to dashboard Cite

An approach using a convolutional neural network (CNN) is proposed for real-time gender classification based on facial images. The proposed CNN architecture exhibits a much reduced design complexity when compared with other CNN solutions applied in pattern recognition. The number of processing layers in the CNN is reduced to only four by fusing the convolutional and subsampling layers. Unlike in conventional CNNs, we replace the convolution operation with cross-correlation, hence reducing the computational load. The network is trained using a second-order backpropagation learning algorithm with annealed global learning rates. Performance evaluation of the proposed CNN solution is conducted on two publicly available face databases of SUMS and AT&T. We achieve classification accuracies of 98.75% and 99.38% on the SUMS and AT&T databases, respectively. The neural network is able to process and classify a 32 × 32 pixel face image in less than 0.27 ms, which corresponds to a very high throughput of over 3700 images per second. Training converges within less than 20 epochs. These results correspond to a superior classification performance, verifying that the proposed CNN is an effective real-time solution for gender recognition.

show abstract

Performance and Scalability of GPU-Based Convolutional Neural Networks

Cited by 202 publications

References 12 publications

A committee of neural networks for traffic sign classification

A committee of neural networks for traffic sign classification

VPRS-Based Regional Decision Fusion of CNN and MRF Classifications for Very Fine Resolution Remotely Sensed Images

Gender classification: a convolutional neural network approach

Contact Info

Product

Resources

About