Visual Recognition by Request

Tang, Chufeng; Xie, Lei; Zhang, Xiaopeng; Hu, Xiaolin

doi:10.48550/arxiv.2207.14227

Cited by 1 publication

(4 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With the recently proposed backbone [92], our methods achieve the new state-of-the-art results with 63.1% PartPQ and 66.5% PWQ. Compared to the recent method using separated vision transformers [57], our methods achieve better results for both ResNet50 and a larger backbone with less GFlops and simpler pipeline. Moreover, compared to Panoptic-PartFormer, Panoptic-PartFormer++ achieve better results on all three metrics, including PQ, PartPQ and PWQ, which can be a new baseline for PPS task.…”

Section: Resultsmentioning

confidence: 89%

“…We find different backbones perform differently TABLE 3: Experiment Results on CPP. Previous works [5], [57] combine results from commonly used (top), and state-of-theart methods (bottom) for semantic segmentation, instance segmentation, panoptic segmentation, and part segmentation. Metrics split into P and NP are evaluated on scene-level classes with and without parts, respectively.…”

Section: Resultsmentioning

confidence: 99%

“…Besides, they use existing panoptic segmentation algorithms with part semantic segmentation as an isolated subnetwork. Recently, there is another work [57] formulating PPS tasks as multi-level recognition by request. However, it still uses the two separated models to handle part and things segmentation.…”

Section: Related Workmentioning

confidence: 99%

“…2, compared with previous PartPQ and HPQ, our new metric fully considers all five properties including pixel-level evaluation, regionlevel evaluation, part-whole evaluation, decouples the errors and balance the part-scene segments. Notice that recent work [57] also proposes hierarchical panoptic quality (HPQ) that can measure the accuracy of segmentation in different depths. However, it still does not well balance the ratio of part segments.…”

Section: Part-whole Quality Metricmentioning

confidence: 99%

See 3 more Smart Citations

PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation

Li¹,

Xu²,

Yang³

et al. 2023

Preprint

View full text Add to dashboard Cite

Panoptic Part Segmentation (PPS) unifies panoptic segmentation and part segmentation into one task. Previous works utilize separated approaches to handle thing, stuff, and part predictions without shared computation and task association. We aim to unify these tasks at the architectural level, designing the first end-to-end unified framework named Panoptic-PartFormer. Moreover, we find the previous metric PartPQ biases to PQ. To handle both issues, we make the following contributions: Firstly, we design a meta-architecture that decouples part feature and things/stuff feature, respectively. We model things, stuff, and parts as object queries and directly learn to optimize all three forms of prediction as a unified mask prediction and classification problem. We term our model as Panoptic-PartFormer. Secondly, we propose a new metric Part-Whole Quality (PWQ) to better measure such task from both pixel-region and part-whole perspectives. It can also decouple the error for part segmentation and panoptic segmentation. Thirdly, inspired by Mask2Former, based on our meta-architecture, we propose Panoptic-PartFormer++ and design a new part-whole cross attention scheme to further boost part segmentation qualities. We design a new part-whole interaction method using masked cross attention. Finally, the extensive ablation studies and analysis demonstrate the effectiveness of both Panoptic-PartFormer and Panoptic-PartFormer++. Compared with previous Panoptic-PartFormer, our Panoptic-PartFormer++ achieves 2% PartPQ and 3% PWQ improvements on the Cityscapes PPS dataset and 5% PartPQ on the Pascal Context PPS dataset. On both datasets, Panoptic-PartFormer++ achieves new state-of-the-art results with a significant cost drop of 70% on GFlops and 50% on parameters. Our models can serve as a strong baseline and aid future research in PPS. Code will be available at https://github.com/lxtGH/Panoptic-PartFormer.

show abstract

Section: Resultsmentioning

confidence: 89%

Section: Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Part-whole Quality Metricmentioning

confidence: 99%

See 2 more Smart Citations