2021
DOI: 10.1007/s11063-021-10537-3
|View full text |Cite
|
Sign up to set email alerts
|

PPIS-JOIN: A Novel Privacy-Preserving Image Similarity Join Method

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 76 publications
0
4
0
Order By: Relevance
“…In the self-attention blocks, the image regions feature and words feature are finally updated as follows, R * = (Att r2r R) × r imp (18) W * = (Att w2w W) × w imp (19) The details of accumulated importance are described in Section 3.3.5.…”
Section: Self-attention With Conditioning Gatesmentioning
confidence: 99%
See 1 more Smart Citation
“…In the self-attention blocks, the image regions feature and words feature are finally updated as follows, R * = (Att r2r R) × r imp (18) W * = (Att w2w W) × w imp (19) The details of accumulated importance are described in Section 3.3.5.…”
Section: Self-attention With Conditioning Gatesmentioning
confidence: 99%
“…At present, multi-modal learning has bridged the gap between visual and language and has been widely concerned [1][2][3][4][5]. Remarkable progress has been made in many multi-modal learning tasks, e.g., image captioning [6][7][8][9], video captioning [10][11][12], cross-modal retrieval [13][14][15][16][17][18][19][20][21][22], and visual question answering(VQA) [7,[23][24][25][26][27][28][29][30][31].…”
Section: Introductionmentioning
confidence: 99%
“…Despite the efficiency of feature learning, the shallow model cannot capture non-linear highlevel semantic features, which impact retrieval accuracy. Thanks for the rapid development of deep learning technology, lots of deep model based cross-modal retrieval approaches have been proposed [5,18,42]. Compared with traditional methods, these deep models have stronger capabilities to learn non-linear semantic correlation from multi-modal contents.…”
Section: Introductionmentioning
confidence: 99%
“…The GraphSAGE model is an inductive learning framework that can efficiently generate unknown vertices embedding by using the attribute information of vertices [6,7]. It is used to derive the user trust relationships from the original social network that hold both local and global information of the social network [8][9][10], and a graph-embedded model-based collaborative filtering recommendation algorithm is proposed. Intuitively, the low-dimensional feature representation of user nodes in social networks can be learned through graph embedding and can be integrated into traditional social network-based recommendation algorithms to address the problems of coarse granularity.…”
Section: Introductionmentioning
confidence: 99%