2022
DOI: 10.48550/arxiv.2210.08908
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…Song et al [53] argued that the semantic ambiguity exists in the visual modality. Recently, semantic enhancement based methods have been dramatically developed, such as memory based methods [29], [54] and external tools based methods [3], [15], [17]. For example, Li et al [54] introduced global memory banks to store features across two modalities, which are employed to enhance the feature representations.…”
Section: B Semantic Enhancement Methods For Itrmentioning
confidence: 99%
See 2 more Smart Citations
“…Song et al [53] argued that the semantic ambiguity exists in the visual modality. Recently, semantic enhancement based methods have been dramatically developed, such as memory based methods [29], [54] and external tools based methods [3], [15], [17]. For example, Li et al [54] introduced global memory banks to store features across two modalities, which are employed to enhance the feature representations.…”
Section: B Semantic Enhancement Methods For Itrmentioning
confidence: 99%
“…Ji et al [29] proposed a heterogeneous memory enhanced graph reasoning network to learn more discriminative and robust representations. Ge et al [17] designed the intra-modal spatial and semantic graphs to enhance their semantic representations, where the visual scene graphs are generated by the off-the-shelf Neural Motifs [55] tool. The existing Transformer-based methods almost design a type of cross-modal interaction architecture to complement or enhance the modality-specific representation, such as [12], [14], [56].…”
Section: B Semantic Enhancement Methods For Itrmentioning
confidence: 99%
See 1 more Smart Citation