2022
DOI: 10.1109/access.2022.3183106
|View full text |Cite
|
Sign up to set email alerts
|

Cascaded MPN: Cascaded Moment Proposal Network for Video Corpus Moment Retrieval

Abstract: Video corpus moment retrieval aims to localize temporal moments corresponding to textual query in a large video corpus. Previous moment retrieval systems are largely grouped into two categories:(1) anchor-based method which presets a set of video segment proposals (via sliding window) and predicts proposal that best matches with the query, and (2) anchor-free method which directly predicts frame-level start-end time of the moment related to the query (via regression). Both methods have their own inherent weakn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 42 publications
0
1
0
Order By: Relevance
“…In an inference, following the work [19], CLNet considers the start-time and end-time distributions (i.e., p, q) via building localization joint probability distributions r in Figure 6. The r is obtained by applying matrix multiplication between p ∈ R L×1 and q ∈ R L×1 as r = pq T ∈ R L×L , where ri,j = pi qj denotes the joint probability that the inference about temporal boundary information would start at index i and end at index j along the frame axis.…”
Section: E Inferencementioning
confidence: 99%
“…In an inference, following the work [19], CLNet considers the start-time and end-time distributions (i.e., p, q) via building localization joint probability distributions r in Figure 6. The r is obtained by applying matrix multiplication between p ∈ R L×1 and q ∈ R L×1 as r = pq T ∈ R L×L , where ri,j = pi qj denotes the joint probability that the inference about temporal boundary information would start at index i and end at index j along the frame axis.…”
Section: E Inferencementioning
confidence: 99%