2021
DOI: 10.48550/arxiv.2111.04578
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improved Regularization and Robustness for Fine-tuning in Neural Networks

Abstract: A widely used algorithm for transfer learning is fine-tuning, where a pre-trained model is fine-tuned on a target task with a small amount of labeled data. When the capacity of the pre-trained model is much larger than the size of the target data set, fine-tuning is prone to overfitting and "memorizing" the training labels. Hence, an important question is to regularize fine-tuning and ensure its robustness to noise. To address this question, we begin by analyzing the generalization properties of fine-tuning. W… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 43 publications
0
3
0
Order By: Relevance
“…Model-Based DTL Fine-Tuning [43,[45][46][47][48][49][50] Self-Training [51][52][53] Transformer Mechanism [54][55][56][57][58] Share and fine-tune the parameters of deep learning models Discrepancy-Based DTL Dual-Stream Architecture [59][60][61][62][63][64][65][66][67][68][69][70][71][72] Operate on Image Features [73][74][75] Reduce feature discrepancies between source and target domains by DNN GAN-Based DTL Feature Extraction [76][77][78][79][80][81] Feature Transformation [82][83][84][85][86][87][88][89][90] Extract domain invariant features by generative adversarial networks Relational-Based DTL Cross-Domain Relationship …”
Section: Dtl Approaches Subcategories Brief Descriptionmentioning
confidence: 99%
See 2 more Smart Citations
“…Model-Based DTL Fine-Tuning [43,[45][46][47][48][49][50] Self-Training [51][52][53] Transformer Mechanism [54][55][56][57][58] Share and fine-tune the parameters of deep learning models Discrepancy-Based DTL Dual-Stream Architecture [59][60][61][62][63][64][65][66][67][68][69][70][71][72] Operate on Image Features [73][74][75] Reduce feature discrepancies between source and target domains by DNN GAN-Based DTL Feature Extraction [76][77][78][79][80][81] Feature Transformation [82][83][84][85][86][87][88][89][90] Extract domain invariant features by generative adversarial networks Relational-Based DTL Cross-Domain Relationship …”
Section: Dtl Approaches Subcategories Brief Descriptionmentioning
confidence: 99%
“…One is the fine-tuning model. which fine-tunes the parameters of source-domain networks to achieve good performance in target domains [43,[45][46][47][48][49][50]. The second is the self-training approach, which is adopted to overcome the limitations of the fine-tuning model in the case of data enhancement and annotation increases [51][52][53].…”
Section: Model-based Dtlmentioning
confidence: 99%
See 1 more Smart Citation