2022
DOI: 10.1007/978-3-031-21648-0_9
|View full text |Cite
|
Sign up to set email alerts
|

Self-supervised Vision Transformers with Data Augmentation Strategies Using Morphological Operations for Writer Retrieval

Abstract: This paper presents an unsupervised approach for writer retrieval based on clustering SIFT descriptors detected at keypoint locations resulting in pseudo-cluster labels. With those cluster labels, a residual network followed by our proposed NetRVLAD, an encoding layer with reduced complexity compared to NetVLAD, is trained on 32×32 patches at keypoint locations. Additionally, we suggest a graph-based reranking algorithm called SGR to exploit similarities of the page embeddings to boost the retrieval performanc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 32 publications
(90 reference statements)
0
1
0
Order By: Relevance
“…These methods rely on hand-crafted features [13][14][15][16][17][18][19][20][21] like SIFT descriptors [15,16,18,21]. Deep learning-based solutions, such as [22,23], tend to work in a very similar way, simply replacing the handcrafted features with learned extractors [17,18].…”
Section: Writer Retrievalmentioning
confidence: 99%
“…These methods rely on hand-crafted features [13][14][15][16][17][18][19][20][21] like SIFT descriptors [15,16,18,21]. Deep learning-based solutions, such as [22,23], tend to work in a very similar way, simply replacing the handcrafted features with learned extractors [17,18].…”
Section: Writer Retrievalmentioning
confidence: 99%