2022
DOI: 10.1007/s11063-021-10713-5
|View full text |Cite
|
Sign up to set email alerts
|

Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(6 citation statements)
references
References 41 publications
0
4
0
1
Order By: Relevance
“…The ViT model has opened a new era in DL [ 41 , 43 ]. However, the self-attention mechanism of the ViT model has second-order complexity.…”
Section: Methodsmentioning
confidence: 99%
“…The ViT model has opened a new era in DL [ 41 , 43 ]. However, the self-attention mechanism of the ViT model has second-order complexity.…”
Section: Methodsmentioning
confidence: 99%
“…Vision Transformer (ViT) modeli derin öğrenmede yeni bir dönem açmıştır [33,34]. Fakat ViT modelinin öz dikkat mekanizması ikinci dereceden karmaşıklığa sahiptir.…”
Section: Convmixer Ağ Mimarisiunclassified
“…Transformerbased multi-modality cross attention was also applied to enhance the interaction of two MR modalities and better investigate multi-modal paired attention. The head number of multi-head attention was set to eight [10].…”
Section: Pre-processing Of Mr Imagesmentioning
confidence: 99%