2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00976
|View full text |Cite
|
Sign up to set email alerts
|

Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
184
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 170 publications
(184 citation statements)
references
References 32 publications
0
184
0
Order By: Relevance
“…Such scenarios usually require the algorithm to be more robust. As the test set evaluation server is closed, we followed most previous works [ 3 , 10 , 11 , 13 , 14 ] and evaluated our method on the validation set. The evaluation metrics are Average Precision ( ) calculated based on multiple intersection-over-union (IoU) thresholds and Average Recall ( ) defined as the maximum recall given some fixed number of segmented instances per video.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Such scenarios usually require the algorithm to be more robust. As the test set evaluation server is closed, we followed most previous works [ 3 , 10 , 11 , 13 , 14 ] and evaluated our method on the validation set. The evaluation metrics are Average Precision ( ) calculated based on multiple intersection-over-union (IoU) thresholds and Average Recall ( ) defined as the maximum recall given some fixed number of segmented instances per video.…”
Section: Resultsmentioning
confidence: 99%
“…We also slightly outperform SipMask [ 3 ] in both and runtime. The gap of accuracy between our method and MaskProp [ 11 ] is mainly due to the fact that Maskprop combines multiple networks and post-processing strategies which are actually time-consuming. For STEm-Seg [ 13 ] and VisTR [ 14 ], both methods process the entire video sequence at the same time.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Similarly, Jain et al [9] propagated features instead of segmentation with flow from key frames. A similar idea was applied to instance segmentation by Bertasius and Torresani [2]. Although these methods seem close, the task is different: (1) they heavily rely on dense annotation in time to learn interpolations explicitly while we cannot afford it due to high cost of expert annotation and difficulty in fine-grained annotation of imperfect frames and (2) they benchmark only on visible objects, while we are solely interested in objects with limited visibility.…”
Section: Related Workmentioning
confidence: 99%