2022
DOI: 10.1007/978-3-031-19781-9_33
|View full text |Cite
|
Sign up to set email alerts
|

Text-Based Temporal Localization of Novel Events

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 72 publications
0
3
0
Order By: Relevance
“…Charades-STA Unseen. Paul et al [9] investigated whether models can perform well on unseen videos that were not encountered before. "Unseen videos" is a term used for videos that contain queries consisting of nouns or verbs that were not included in the training set (i.e., unseen queries).…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…Charades-STA Unseen. Paul et al [9] investigated whether models can perform well on unseen videos that were not encountered before. "Unseen videos" is a term used for videos that contain queries consisting of nouns or verbs that were not included in the training set (i.e., unseen queries).…”
Section: Datasetsmentioning
confidence: 99%
“…We demonstrate the effectiveness of our model by showing superior performance on Charades-STA [1] and QVHighlights [6] datasets. In addition, we verify the robustness of BM-DETR by conducting comprehensive experiments on three challenging datasets: Charades-CD [7], Charades-CG [8], and Charades-STA Unseen [9], containing out-of-distribution test cases that are representative of real-world scenarios.…”
Section: Introductionmentioning
confidence: 99%
“…Fully-Supervised Video Grounding. Many prior research employs supervised methods [21], [22], [23], [24], [25]. To achieve precise moment localization via language description, it is essential for a video grounding model to implement cross-modal alignment of videos and sentences.…”
Section: Related Workmentioning
confidence: 99%