ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

Zhang, Xiangyu; Zhou, Xinyu; Lin, Mengxiao; Sun, Jian

doi:10.1109/cvpr.2018.00716

Cited by 6,527 publications

(3,782 citation statements)

References 48 publications

Supporting

Mentioning

3,750

Contrasting

Unclassified

Order By: Relevance

“…Our method is a 'one-stop-shop' work-62 flow: we collected large patient cohorts for individual tumor types, partitioning each cohort into 63 tion of microsatellite instability (MSI) in colorectal cancer as a clinically relevant benchmark task 20 66 and sampled a large hyperparameter space with different commonly used deep learning mod-67 els 16,18,20,21 . Unexpectedly, 'inception' 23 and 'resnet' 24 networks, which had been the previous de-68 facto standard, were markedly outperformed by 'densenet' 25 and 'shufflenet' 14 were highly significantly detectable from histology alone, reaching AUCs of up to 0.82 in a three-79 fold patient-level cross-validation ( Fig. 1e).…”

mentioning

confidence: 94%

Pan-cancer image-based detection of clinically actionable genetic alterations

Kather

Heij

Grabsch

et al. 2019

Preprint

191

View full text Add to dashboard Cite

and tluedde@ukaachen.de 34 35 Precision treatment of cancer relies on genetic alterations which are diagnosed by molecular 36 biology assays. 1 These tests can be a bottleneck in oncology workflows because of high turna-37 round time, tissue usage and costs. 2 Here, we show that deep learning can predict point muta-38 tions, molecular tumor subtypes and immune-related gene expression signatures 3,4 directly 39 from routine histological images of tumor tissue. We developed and systematically optimized 40 a one-stop-shop workflow and applied it to more than 4000 patients with breast 5 , colon and 41 rectal 6 , head and neck 7 , lung 8,9 , pancreatic 10 , prostate 11 cancer, melanoma 12 and gastric 13 can-42 cer. Together, our findings show that a single deep learning algorithm can predict clinically ac-43 tionable alterations from routine histology data. Our method can be implemented on mobile 44 hardware 14 , potentially enabling point-of-care diagnostics for personalized cancer treatment 45 in individual patients. 46 Clinical guidelines recommend molecular testing of tumor tissue for most patients with advanced 47 209 The results are in part based upon data generated by the TCGA Research Network: http://can-210 cergenome.nih.gov/. Our funding sources are as follows. J.N.K.: RWTH University Aachen (START 211

show abstract

mentioning

confidence: 94%

Pan-cancer image-based detection of clinically actionable genetic alterations

Kather

Heij

Grabsch

et al. 2019

Preprint

191

View full text Add to dashboard Cite

show abstract

“…They visualize the feature maps extracted by different filters and view each filter as a visual unit focusing on different visual components.of the ResNet-50 [28], and meanwhile save more than 75% of parameters and 50% computational time. In the literature, approaches for compressing the deep networks can be classified into five categories: parameter pruning [26,29,30,31], parameter quantizing [32,33,34,35,36,37,38,39,40,41], low-rank parameter factorization [42,43,44,45,46], transferred/compact convolutional filters [47,48,49,50], and knowledge distillation [51,52,53,54,55,56]. The parameter pruning and quantizing mainly focus on eliminating the redundancy in the model parameters respectively by removing the redundant/uncritical ones or compressing the parameter space (e.g.…”

mentioning

confidence: 99%

Binary neural networks: A survey

Qin

Gong

Liu

et al. 2020

Pattern Recognition

415

171

View full text Add to dashboard Cite

The binary neural network, largely saving the storage and computation, serves as a promising technique for deploying deep models on resource-limited devices.However, the binarization inevitably causes severe information loss, and even worse, its discontinuity brings difficulty to the optimization of the deep network.To address these issues, a variety of algorithms have been proposed, and achieved satisfying progress in recent years. In this paper, we present a comprehensive survey of these algorithms, mainly categorized into the native solutions directly conducting binarization, and the optimized ones using techniques like minimizing the quantization error, improving the network loss function, and reducing the gradient error. We also investigate other practical aspects of binary neural networks such as the hardware-friendly design and the training tricks. Then, we give the evaluation and discussions on different tasks, including image classification, object detection and semantic segmentation. Finally, the challenges that may be faced in future research are prospected.the heavy computation and storage still inevitably limit the applications of the deep CNNs in practice. Besides, due to the huge model parameter space, the prediction of the neural networks is usually viewed as a black-box, which brings great challenges to the interpretability of CNNs. Some works like [21,22,23] empirically explore the function of each layer in the network. They visualize the feature maps extracted by different filters and view each filter as a visual unit focusing on different visual components.of the ResNet-50 [28], and meanwhile save more than 75% of parameters and 50% computational time. In the literature, approaches for compressing the deep networks can be classified into five categories: parameter pruning [26,29,30,31], parameter quantizing [32,33,34,35,36,37,38,39,40,41], low-rank parameter factorization [42,43,44,45,46], transferred/compact convolutional filters [47,48,49,50], and knowledge distillation [51,52,53,54,55,56]. The parameter pruning and quantizing mainly focus on eliminating the redundancy in the model parameters respectively by removing the redundant/uncritical ones or compressing the parameter space (e.g. , from the floating-point weights to the integer ones). Low-rank factorization applies the matrix/tensor decomposition techniques to estimate the informative parameters using the proxy ones of small size. The compact convolutional filter based approaches rely on the carefullydesigned structural convolutional filters to reduce the storage and computation complexity. The knowledge distillation methods try to distill a more compact model to reproduce the output of a larger network.Among the existing network compression techniques, quantization based one serves as a promising and fast solution that yields highly compact models compared to their floating-point counterparts, by representing the network weights with very low precision. Along this direction, the most extreme quantization is binarization, the interest...

show abstract

“…Thus, the Depthwise-STFT separable layer has a lower space-time complexity when com-arXiv:2001.09912v1 [cs.CV] 27 Jan 2020 pared to the depthwise separable convolutions. Furthermore, we show experimentally that the proposed layer achieves better performance compared to the many state-of-the-art depthwise separable based models such as MobileNet [6,7] and ShuffleNet [8,9].…”

Section: Introductionmentioning

confidence: 93%

Depthwise-STFT Based Separable Convolutional Neural Networks

Kumawat

Raman

2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

In this paper, we propose a new convolutional layer called Depthwise-STFT Separable layer that can serve as an alternative to the standard depthwise separable convolutional layer. The construction of the proposed layer is inspired by the fact that the Fourier coefficients can accurately represent important features such as edges in an image. It utilizes the Fourier coefficients computed (channelwise) in the 2D local neighborhood (e.g., 3 × 3) of each position of the input map to obtain the feature maps. The Fourier coefficients are computed using 2D Short Term Fourier Transform (STFT) at multiple fixed low frequency points in the 2D local neighborhood at each position. These feature maps at different frequency points are then linearly combined using trainable pointwise (1 × 1) convolutions. We show that the proposed layer outperforms the standard depthwise separable layer based models on the CIFAR-10 and CIFAR-100 image classification datasets with reduced space-time complexity.

show abstract

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

Cited by 6,527 publications

References 48 publications

Pan-cancer image-based detection of clinically actionable genetic alterations

Pan-cancer image-based detection of clinically actionable genetic alterations

Binary neural networks: A survey

Depthwise-STFT Based Separable Convolutional Neural Networks

Contact Info

Product

Resources

About