Identity Mappings in Deep Residual Networks

He, Kai; Zhang, Xiangyu; Ren, Shouxin; Sun, Jian

doi:10.1007/978-3-319-46493-0_38

Cited by 7,492 publications

(6,142 citation statements)

References 12 publications

Supporting

Mentioning

5,708

Contrasting

Unclassified

Order By: Relevance

“…Conditional computation [Bengio et al, 2015] and adaptive computation [Graves, 2016] propose to adjust the amount of computational cost by using a policy to select data. Many of these static and dynamic techniques are used in standard deep architectures such as ResNet [He et al, 2016a] and Inception [Szegedy et al, 2017], usually with a loss of accuracy. Different from these static and dynamic techniques, our method explicitly formulates the testtime efficiency as an amortized constrained sequential decision problem such that the expected computational cost, in terms of FLOPs cost, can be greatly reduced with even improved accuracy by adaptively assigning training examples with various difficulty to their best classifiers.…”

Section: Related Workmentioning

confidence: 99%

“…Deep residual network [He et al, 2016a] has been widely used in image classification field since it was proposed. We choose ResNet as our model's baseline because we can easily build a sequence of networks from shallow to deep by adjusting the number of units in each block.…”

Section: Cascaded Classifiers Using Resnetmentioning

confidence: 99%

“…Since the first success of deep convolutional neural network in the ImageNet challenge, more complex architectures [Simonyan and Zisserman, 2014;He et al, 2016a;Szegedy et al, 2017] have been proposed to further improve performance, but often at the cost of more expensive computation. However, in many real-world scenarios, such as vision-based robotics and mobile vision applications, we encounter a significant constraint of energy or computational cost for real-time inference.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Energy-efficient Amortized Inference with Cascaded Deep Classifiers

Guan

Liu

et al. 2018

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

View full text Add to dashboard Cite

Deep neural networks have been remarkable successful in various AI tasks but often cast high computation and energy cost for energy-constrained applications such as mobile sensing. We address this problem by proposing a novel framework that optimizes the prediction accuracy and energy cost simultaneously, thus enabling effective cost-accuracy trade-off at test time. In our framework, each data instance is pushed into a cascade of deep neural networks with increasing sizes, and a selection module is used to sequentially determine when a sufficiently accurate classifier can be used for this data instance. The cascade of neural networks and the selection module are jointly trained in an end-toend fashion by the REINFORCE algorithm to optimize a trade-off between the computational cost and the predictive accuracy. Our method is able to simultaneously improve the accuracy and efficiency by learning to assign easy instances to fast yet sufficiently accurate classifiers to save computation and energy cost, while assigning harder instances to deeper and more powerful classifiers to ensure satisfiable accuracy. Moreover, we demonstrate our method's effectiveness with extensive experiments on CIFAR-10/100, ImageNet32x32 and original ImageNet dataset.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Cascaded Classifiers Using Resnetmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Energy-efficient Amortized Inference with Cascaded Deep Classifiers

Guan

Liu

et al. 2018

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

View full text Add to dashboard Cite

show abstract

“…One of such architectures is a novel ultra-deep residual learning network (ResNet) [4]. This architecture can be implemented by adding so called 'shortcut connections' [5] which skip one or more layers. They perform a mapping so that their outputs are added to the outputs of the stacked layes.…”

Section: Convolutional Neural Networkmentioning

confidence: 99%

Artificial Neural Networks and Machine Learning – ICANN 2017

2017

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Images are an important data source for diagnosis and treatment of oral diseases. The manual classification of images may lead to misdiagnosis or mistreatment due to subjective errors. In this paper an image classification model based on Convolutional Neural Network is applied to Quantitative Light-induced Fluorescence images. The deep neural network outperforms other state of the art shallow classification models in predicting labels derived from three different dental plaque assessment scores. The model directly benefits from multi-channel representation of the images resulting in improved performance when, besides the Red colour channel, additional Green and Blue colour channels are used.

show abstract

“…For one, spelling errors are quite prevalent in learners' written production (Kochmar, 2011). Additionally, spelling errors have been shown to be influenced by phonological L1 transfer (Grigonytė and Hammarberg, 2014 (He et al, 2015(He et al, , 2016. Such skip connections facilitate error propagation to earlier layers in the network, which allows for building deeper networks.…”

Section: Spelling Featuresmentioning

confidence: 99%

Neural Networks and Spelling Features for Native Language Identification

Bjerva¹,

Grigonytė²,

Östling³

et al. 2017

Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications

View full text Add to dashboard Cite

We present the RUG-SU team's submission at the Native Language Identification Shared Task 2017. We combine several approaches into an ensemble, based on spelling error features, a simple neural network using word representations, a deep residual network using word and character features, and a system based on a recurrent neural network. Our best system is an ensemble of neural networks, reaching an F1 score of 0.8323. Although our system is not the highest ranking one, we do outperform the baseline by far.

show abstract

Identity Mappings in Deep Residual Networks

Cited by 7,492 publications

References 12 publications

Energy-efficient Amortized Inference with Cascaded Deep Classifiers

Energy-efficient Amortized Inference with Cascaded Deep Classifiers

Artificial Neural Networks and Machine Learning – ICANN 2017

Neural Networks and Spelling Features for Native Language Identification

Contact Info

Product

Resources

About