2022
DOI: 10.48550/arxiv.2205.08099
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a Survey

Abstract: State-of-the-art deep learning models have a parameter count that reaches into the billions. Training, storing and transferring such models is energy and time consuming, thus costly. A big part of these costs is caused by training the network. Model compression lowers storage and transfer costs, and can further make training more efficient by decreasing the number of computations in the forward and/or backward pass. Thus, compressing networks also at training time while maintaining a high performance is an imp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 51 publications
0
1
0
Order By: Relevance
“…Over the years, CNN architectures have progressed and developed alternative network designs with the addition of new features and operations to make them more effective; examples include DenseNet 4 ResNet 5 and EfficientNet 6 . Many authors have reviewed or investigated the effectiveness of untrained components in CNN architectures 7,8,9,10,11 . In this work, we investigate how effective untrained convolutional layers are for feature extraction in a computational pathology task using real world data sourced from an available necrosis detection dataset.…”
Section: Introductionmentioning
confidence: 99%
“…Over the years, CNN architectures have progressed and developed alternative network designs with the addition of new features and operations to make them more effective; examples include DenseNet 4 ResNet 5 and EfficientNet 6 . Many authors have reviewed or investigated the effectiveness of untrained components in CNN architectures 7,8,9,10,11 . In this work, we investigate how effective untrained convolutional layers are for feature extraction in a computational pathology task using real world data sourced from an available necrosis detection dataset.…”
Section: Introductionmentioning
confidence: 99%