Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022
DOI: 10.24963/ijcai.2022/384
|View full text |Cite
|
Sign up to set email alerts
|

GL-RG: Global-Local Representation Granularity for Video Captioning

Abstract: Node classification on graphs can be formulated as the Dirichlet problem on graphs where the signal is given at the labeled nodes, and the harmonic extension is done on the unlabeled nodes. This paper considers a time-dependent version of the Dirichlet problem on graphs and shows how to improve its solution by learning the proper initialization vector on the unlabeled nodes. Further, we show that the improved solution is at par with state-of-the-art methods used for node classification. Finally, we conclude th… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 40 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…The results showed enhanced performance in comparison to the two-stage method in terms of accuracy and inference speed using the YouTube-VIS dataset. Yan et al [14] proposed a Global-Local Representation Granularity (GL-RG) scheme in the domain of video captioning. The proposed approach improved the modeling of the representation of global-local vision across video frames for caption generation.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The results showed enhanced performance in comparison to the two-stage method in terms of accuracy and inference speed using the YouTube-VIS dataset. Yan et al [14] proposed a Global-Local Representation Granularity (GL-RG) scheme in the domain of video captioning. The proposed approach improved the modeling of the representation of global-local vision across video frames for caption generation.…”
Section: Literature Reviewmentioning
confidence: 99%
“…In the past decade, the computer vision community has achieved significant progress in many tasks with the development of deep learning [1]- [4]. Among various visual tasks, instance segmentation [5] has drawn wide attention due to its importance in many emerging applications, such as autonomous driving [6]- [11], augmented reality [12], [13], and video captioning [14], [15]. Technically, it is quite challenging as it is a compound task consisting of both object detection and segmentation, each of which is a difficult task and has been studied for a long time.…”
Section: Introductionmentioning
confidence: 99%