2022
DOI: 10.48550/arxiv.2211.12328
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A survey on knowledge-enhanced multimodal learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…Web knowledge encompasses both external and internal knowledge [66] and offers a significant advantage. Therefore, the work described in this article relies on the web knowledge rather than common sense knowledge sourced from Concept-Net.…”
Section: Related Work a Multimodal Machine Learning For Memesmentioning
confidence: 99%
“…Web knowledge encompasses both external and internal knowledge [66] and offers a significant advantage. Therefore, the work described in this article relies on the web knowledge rather than common sense knowledge sourced from Concept-Net.…”
Section: Related Work a Multimodal Machine Learning For Memesmentioning
confidence: 99%
“…Prior surveys in VL learning [36,37,38,39,40,41,42] do not focus on the collaboration between knowledge and deep learning VL models. An exhaustive presentation of the knowledgeenhanced VL (KVL) topic was presented in [43] for the first time. In the current survey paper, we focus on state-of-the-art endeavors involving transformer models for the VL representation, leading to hybrid approaches when combined with external knowledge.…”
Section: Figurementioning
confidence: 99%
“…External knowledge sources are divided in two main categories, explicit and implicit [43]. They are both capable of providing factual, commonsense, temporal, lexical or other knowledge senses [44] missing from pre-trained VL models.…”
Section: Types Of External Knowledgementioning
confidence: 99%
See 1 more Smart Citation