2022
DOI: 10.1007/s13735-022-00233-w
|View full text |Cite
|
Sign up to set email alerts
|

Siamese coding network and pair similarity prediction for near-duplicate image detection

Abstract: Near-duplicate detection in a dataset involves finding the elements that are closest to a new query element according to a given similarity function and proximity threshold. The brute force approach is very computationally intensive as it evaluates the similarity between the queried item and all items in the dataset. The potential application domain is an image sharing website that checks for plagiarism or piracy every time a new image is uploaded. Among the various approaches, near-duplicate detection was eff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 19 publications
1
4
0
Order By: Relevance
“…In these experiments, the backbone is a Siamese network with a 2,048-dimension. As can be seen from Table 3, a threshold of 0.5 yields the best IDF1 of 0.7521, which is in agreement with previous work ( 52 , 53 ).…”
Section: Resultssupporting
confidence: 92%
See 2 more Smart Citations
“…In these experiments, the backbone is a Siamese network with a 2,048-dimension. As can be seen from Table 3, a threshold of 0.5 yields the best IDF1 of 0.7521, which is in agreement with previous work ( 52 , 53 ).…”
Section: Resultssupporting
confidence: 92%
“…Based on an empirical tuning proposed in Fisichella et al ( 52 , 53 ), the distance threshold τ is set to 0.5 in the construction of the graph feature. It is necessary, however, to conduct an ablation study with different thresholds to confirm this decision.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The train-test split is based only on the anchor sequence q and does not consider the comparison sequences . We explored two ANN techniques (FCN and CNN) with two loss functions (triplet loss, and ladder loss) to create vector representations of DNA sequences using a Siamese Neural Network (SNN) [40] architecture as shown in Fig. 3 .…”
Section: Methodsmentioning
confidence: 99%
“…A Siamese neural network (SNN), sometimes called a twin neural network, is an artificial neural network that uses 2 parallel, weight-sharing machine learning models to compute comparable embeddings. The SNN architecture has shown promising results as an FSL approach in computer vision for similarity detection [ 8 ] and duplicate identification [ 9 ]. Yet, its usage in NLP has been understudied, and, to the best of our knowledge, there have not been any studies investigating SNNs for clinical NLP.…”
Section: Introductionmentioning
confidence: 99%