2020
DOI: 10.1109/tcsvt.2019.2940647
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Modal Deep Analysis for Multimedia

Abstract: With the rapid development of Internet and multimedia services in the past decade, a huge amount of usergenerated and service provider-generated multimedia data become available. These data are heterogeneous and multi-modal in nature, imposing great challenges for processing and analyzing them. Multi-modal data consist of a mixture of various types of data from different modalities such as texts, images, videos, audios etc. In this article, we present a deep and comprehensive overview for multi-modal analysis … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(4 citation statements)
references
References 134 publications
(155 reference statements)
0
4
0
Order By: Relevance
“…The closer consumers are to the pandemic-affected region/province or city, the more frequently they search to obtain information through convenient, accessible social media, likely to alleviate the fear of the unknown heightened by the information asymmetry. The multi-modal presentations on social media platforms [49] that are visually and audibly stimulating, the feature that "everyone has a microphone", and the possibility that information is untrue pose a monitoring challenge and could mislead the social media audience. The results show that social media use is a path for regional/province epidemics to affect consumer perceptions of the safety of online food purchases, supporting Hypothesis 3.…”
Section: Regional Effectmentioning
confidence: 99%
“…The closer consumers are to the pandemic-affected region/province or city, the more frequently they search to obtain information through convenient, accessible social media, likely to alleviate the fear of the unknown heightened by the information asymmetry. The multi-modal presentations on social media platforms [49] that are visually and audibly stimulating, the feature that "everyone has a microphone", and the possibility that information is untrue pose a monitoring challenge and could mislead the social media audience. The results show that social media use is a path for regional/province epidemics to affect consumer perceptions of the safety of online food purchases, supporting Hypothesis 3.…”
Section: Regional Effectmentioning
confidence: 99%
“…Many works [19], [20], [24], [43]- [46] consider aggregating information from multi-modal data for various image and video content analysis tasks, such as video action recognition, video anomaly detection and localization, and so on. For example, Liu et al [19] and Dan et al [20] used the skeleton data and RGB frames for action recognition.…”
Section: Multi-modal Fusionmentioning
confidence: 99%
“…Formally, synsets refer to sets of instances having the same meanings. The synset induction task plays an important role in the domain of multimodal machine learning [1][2][3]. Take the image captioning task [4][5][6] as an example, in which the machine algorithm attempts to generate a descriptive sentence for a given image.…”
Section: Introductionmentioning
confidence: 99%