Shihong Du scite author profile

In this paper we investigate image classification with computational resource limits at test time. Two such settings are: 1. anytime classification, where the network's prediction for a test example is progressively updated, facilitating the output of a prediction at any time; and 2. budgeted batch classification, where a fixed amount of computation is available to classify a set of examples that can be spent unevenly across "easier" and "harder" inputs. In contrast to most prior work, such as the popular Viola and Jones algorithm, our approach is based on convolutional neural networks. We train multiple classifiers with varying resource demands, which we adaptively apply during test time. To maximally re-use computation between the classifiers, we incorporate them as early-exits into a single deep convolutional neural network and inter-connect them with dense connectivity. To facilitate high quality classification early on, we use a two-dimensional multi-scale network architecture that maintains coarse and fine level features all-throughout the network. Experiments on three image-classification tasks demonstrate that our framework substantially improves the existing state-of-the-art in both settings.However, the requirements of such competitions differ from realworld applications, which tend to incentivize resource-hungry models with high computational demands at inference time. For example, the COCO 2016 competition was won by a large ensemble of computationally intensive CNNs 1 -a model likely far too computationally expensive for any resource-aware application. Although much smaller models would also obtain decent error, very large, computationally intensive models seem necessary to correctly classify the hard examples that make up the bulk of the remaining misclassifications of modern algorithms. To illustrate this point, Figure 1 shows two images of horses. The left image depicts a horse in canonical pose and is easy to classify, whereas the right image is taken from a rare viewpoint and is likely in the tail of the data distribution. Computationally intensive models are needed to classify such tail examples correctly, but are wasteful when applied to canonical images such as the left one.In real-world applications, computation directly translates into power consumption, which should be minimized for environmental and economical reasons, and is a scarce commodity on mobile 1 http://image-net.org/challenges/talks/2016/GRMI-COCO-slidedeck.pdf 1 arXiv:1703.09844v5 [cs.LG]

show abstract

Spectral–Spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach

Zhao

2016

IEEE Trans. Geosci. Remote Sensing

973

383

View full text Add to dashboard Cite

Learning multiscale and deep representations for classifying remotely sensed imagery

Zhao

2016

ISPRS Journal of Photogrammetry and Remote Sensing

345

155

View full text Add to dashboard Cite

The relationships between landscape compositions and land surface temperature: Quantifying their resolution sensitivity with spatial regression models

Juer

Feng

et al. 2014

Landscape and Urban Planning

228

122

View full text Add to dashboard Cite

Hierarchical semantic cognition for urban functional zones with VHR satellite images and POI data

Zhang

Wang

2017

ISPRS Journal of Photogrammetry and Remote Sensing

221

105

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shihong Du

Multi-Scale Dense Networks for Hyperspectral Remote Sensing Image Classification

Spectral–Spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach

Learning multiscale and deep representations for classifying remotely sensed imagery

The relationships between landscape compositions and land surface temperature: Quantifying their resolution sensitivity with spatial regression models

Hierarchical semantic cognition for urban functional zones with VHR satellite images and POI data

Contact Info

Product

Resources

About