Block-term tensor neural networks

Ye, Jinmian; Li, Guangxi; Chen, Di; Yang, Haiqin; Zhe, Shandian; Xu, Zenglin

doi:10.1016/j.neunet.2020.05.034

Cited by 22 publications

(5 citation statements)

References 59 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Employing TT on Conv layers is introduced in [38], where the 4D kernel tensor should be reshaped to size of and the input feature maps are reshaped to . In the feedforward phase, the tensorized input will be contracted with each TT-core one by one.…”

Section: Fig 4 a Fourth Order Tensor In Tt Formatmentioning

confidence: 99%

Low Rank Optimization for Efficient Deep Learning: Making a Balance Between Compact Architecture And Fast Training

Ou,

Chen,

Zhu

et al. 2024

J. of Syst. Eng. Electron.

View full text Add to dashboard Cite

Deep neural networks (DNNs) have achieved great success in many data processing applications. However, high computational complexity and storage cost make deep learning difficult to be used on resource-constrained devices, and it is not environmental-friendly with much power cost. In this paper, we focus on low-rank optimization for efficient deep learning techniques. In the space domain, DNNs are compressed by low rank approximation of the network parameters, which directly reduces the storage requirement with a smaller number of network parameters. In the time domain, the network parameters can be trained in a few subspaces, which enables efficient training for fast convergence. The model compression in the spatial domain is summarized into three categories as pre-train, pre-set, and compression-aware methods, respectively. With a series of integrable techniques discussed, such as sparse pruning, quantization, and entropy coding, we can ensemble them in an integration framework with lower computational complexity and storage. In addition to summary of recent technical advances, we have two findings for motivating future works. One is that the effective rank, derived from the Shannon entropy of the normalized singular values, outperforms other conventional sparse measures such as the norm for network compression. The other is a spatial and temporal balance for tensorized neural networks. For accelerating the training of tensorized neural networks, it is crucial to leverage redundancy for both model compression and subspace training.

show abstract

Section: Fig 4 a Fourth Order Tensor In Tt Formatmentioning

confidence: 99%

Low Rank Optimization for Efficient Deep Learning: Making a Balance Between Compact Architecture And Fast Training

Ou,

Chen,

Zhu

et al. 2024

J. of Syst. Eng. Electron.

View full text Add to dashboard Cite

show abstract

“…In this setting, other types of decompositions have also been explored, including Tensor-Ring [131] and Block-Term Decomposition [132]. This strategy has also been extended to parametrize other types of layers [133].…”

Section: A Parameterizing Fully-connected Layersmentioning

confidence: 99%

Tensor Methods in Computer Vision and Deep Learning

et al. 2021

View full text Add to dashboard Cite

Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have become even more fundamental. Indeed, essential ingredients in modern deep learning architectures, such as convolutions and attention mechanisms, can readily be considered as tensor mappings. In effect, tensor methods are increasingly finding significant applications in deep learning, including the design of memory and compute efficient network architectures, improving robustness to random noise and adversarial attacks, and aiding the theoretical understanding of deep networks.This article provides an in-depth and practical review of tensors and tensor methods in the context of representation learning and deep learning, with a particular focus on visual data analysis and computer vision applications. Concretely, besides fundamental work in tensor-based visual data analysis methods, we focus on recent developments that have brought on a gradual increase of tensor methods, especially in deep learning architectures, and their implications in computer vision applications. To further enable the newcomer to grasp such concepts quickly, we provide companion Python notebooks, covering key aspects of the paper and implementing them, step-by-step with TensorLy.

show abstract

“…The groundbreaking works (Novikov et al, 2015;Garipov et al, 2016) demonstrate that the loworder parameter structures can be efficiently compressed via tensor-train decomposition (Oseledets, 2011) by first reshaping the structures into a higher-order tensor. This idea is later extended in two directions: tensor-train decomposition is used to compress LSTM/GRU layers in recurrent neural networks (Yang et al, 2017), higher-order recurrent neural networks (Yu et al, 2017;Su et al, 2020), and 3D convolutional layers (Wang et al, 2020); other decompositions are also explored for better compression, such as tensor-ring decomposition (Zhao et al, 2016) and blockterm decomposition (Ye et al, 2020). et al (2015) proposed to train the student network with the teacher network's logits (the vector before the softmax layer).…”

Section: Model Compression Of Neural Networkmentioning

confidence: 99%

Compact Neural Architecture Designs by Tensor Representations

Liu

et al. 2022

Front. Artif. Intell.

View full text Add to dashboard Cite

We propose a framework of tensorial neural networks (TNNs) extending existing linear layers on low-order tensors to multilinear operations on higher-order tensors. TNNs have three advantages over existing networks: First, TNNs naturally apply to higher-order data without flattening, which preserves their multi-dimensional structures. Second, compressing a pre-trained network into a TNN results in a model with similar expressive power but fewer parameters. Finally, TNNs interpret advanced compact designs of network architectures, such as bottleneck modules and interleaved group convolutions. To learn TNNs, we derive their backpropagation rules using a novel suite of generalized tensor algebra. With backpropagation, we can either learn TNNs from scratch or pre-trained models using knowledge distillation. Experiments on VGG, ResNet, and Wide-ResNet demonstrate that TNNs outperform the state-of-the-art low-rank methods on a wide range of backbone networks and datasets.

show abstract

Block-term tensor neural networks

Cited by 22 publications

References 59 publications

Low Rank Optimization for Efficient Deep Learning: Making a Balance Between Compact Architecture And Fast Training

Low Rank Optimization for Efficient Deep Learning: Making a Balance Between Compact Architecture And Fast Training

Tensor Methods in Computer Vision and Deep Learning

Compact Neural Architecture Designs by Tensor Representations

Contact Info

Product

Resources

About