2022
DOI: 10.1109/tcsvt.2022.3193612
|View full text |Cite
|
Sign up to set email alerts
|

MFNet: Multiclass Few-Shot Segmentation Network With Pixel-Wise Metric Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(3 citation statements)
references
References 58 publications
0
3
0
Order By: Relevance
“…It can better discover co-occurring objects in two images through a cross-reference mechanism, helping to complete the task of few-shot segmentation. MFNet [35] is a novel multiplexed (class) encoding and decoding architecture. It successfully combines multi-scale query and multi-class support data into a single query support embedment.…”
Section: Related Workmentioning
confidence: 99%
“…It can better discover co-occurring objects in two images through a cross-reference mechanism, helping to complete the task of few-shot segmentation. MFNet [35] is a novel multiplexed (class) encoding and decoding architecture. It successfully combines multi-scale query and multi-class support data into a single query support embedment.…”
Section: Related Workmentioning
confidence: 99%
“…This requires the model to understand the spatial context, boundaries, and appearance of objects despite the limited amount of labeled data. Existing methods can be categorized into two main groups: meta-learning and metric-based learning [12]- [15]. With meta-learning, the model can quickly adapt to new categories in several sample contexts, thus better handling unseen semantic categories.…”
Section: Introductionmentioning
confidence: 99%
“…However, despite their effectiveness, pure textual information still cannot fully replicate a rich visual perceptual experience. To address this limitation, researchers have turned their attention to various visual language tasks, such as visual quizzing (Li et al; Zhang et al) [6,7], image and video caption generation (Chen F; Ghanimifard and Dobnik; Corniaet al) [8][9][10], and image-based question retrieval (Xin Yuan et al; Lu et al) [11,12]. In human conversational communication, images are crucial in compensating for information that cannot be accurately expressed through text alone.…”
Section: Introductionmentioning
confidence: 99%