2022 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML) 2022
DOI: 10.1109/icicml57342.2022.10009863
|View full text |Cite
|
Sign up to set email alerts
|

Modality-specific Adaptive Scaling Method for Cross-modal Retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 13 publications
0
1
0
Order By: Relevance
“…Various steps are taken to enhance the effectiveness and accuracy of ViT algorithm including adjustment of model architecture, initialization of weights, and tunning hyperparameters after various iterations of model training. Adaptive scaling [29] was introduced in ViT architecture to enhance model performance through the process of image serialization [30]. Another novelty of this work is introduction of contrastive learning and adaptive scaling in ViT model.…”
Section: Methodsmentioning
confidence: 99%
“…Various steps are taken to enhance the effectiveness and accuracy of ViT algorithm including adjustment of model architecture, initialization of weights, and tunning hyperparameters after various iterations of model training. Adaptive scaling [29] was introduced in ViT architecture to enhance model performance through the process of image serialization [30]. Another novelty of this work is introduction of contrastive learning and adaptive scaling in ViT model.…”
Section: Methodsmentioning
confidence: 99%