2022
DOI: 10.48550/arxiv.2207.14227
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Visual Recognition by Request

Abstract: In this paper, we present a novel protocol of annotation and evaluation for visual recognition. Different from traditional settings, the protocol does not require the labeler/algorithm to annotate/recognize all targets (objects, parts, etc.) at once, but instead raises a number of recognition instructions and the algorithm recognizes targets by request. This mechanism brings two beneficial properties to reduce the burden of annotation, namely, (i) variable granularity: different scenarios can have different le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 48 publications
0
4
0
Order By: Relevance
“…With the recently proposed backbone [92], our methods achieve the new state-of-the-art results with 63.1% PartPQ and 66.5% PWQ. Compared to the recent method using separated vision transformers [57], our methods achieve better results for both ResNet50 and a larger backbone with less GFlops and simpler pipeline. Moreover, compared to Panoptic-PartFormer, Panoptic-PartFormer++ achieve better results on all three metrics, including PQ, PartPQ and PWQ, which can be a new baseline for PPS task.…”
Section: Resultsmentioning
confidence: 89%
See 3 more Smart Citations
“…With the recently proposed backbone [92], our methods achieve the new state-of-the-art results with 63.1% PartPQ and 66.5% PWQ. Compared to the recent method using separated vision transformers [57], our methods achieve better results for both ResNet50 and a larger backbone with less GFlops and simpler pipeline. Moreover, compared to Panoptic-PartFormer, Panoptic-PartFormer++ achieve better results on all three metrics, including PQ, PartPQ and PWQ, which can be a new baseline for PPS task.…”
Section: Resultsmentioning
confidence: 89%
“…We find different backbones perform differently TABLE 3: Experiment Results on CPP. Previous works [5], [57] combine results from commonly used (top), and state-of-theart methods (bottom) for semantic segmentation, instance segmentation, panoptic segmentation, and part segmentation. Metrics split into P and NP are evaluated on scene-level classes with and without parts, respectively.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations