2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.472
|View full text |Cite
|
Sign up to set email alerts
|

Fully Convolutional Instance-Aware Semantic Segmentation

Abstract: We present the first fully convolutional end-to-end solution for instance-aware semantic segmentation task. It inherits all the merits of FCNs for semantic segmentation [29] and instance mask proposal [5]. It detects and segments the object instances jointly and simultanoulsy. By the introduction of position-senstive inside/outside score maps, the underlying convolutional representation is fully shared between the two sub-tasks, as well as between all regions of interest. The proposed network is highly integra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
715
0
3

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 1,011 publications
(720 citation statements)
references
References 36 publications
2
715
0
3
Order By: Relevance
“…One-stage instance segmentation methods generate position sensitive maps that are assembled into final masks with positionsensitive pooling [3], [16] or combine semantic segmentation logits and direction prediction logits [17]. Though conceptually faster than two-stage methods, they still require repooling or other non-trivial computations (e.g., mask voting).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…One-stage instance segmentation methods generate position sensitive maps that are assembled into final masks with positionsensitive pooling [3], [16] or combine semantic segmentation logits and direction prediction logits [17]. Though conceptually faster than two-stage methods, they still require repooling or other non-trivial computations (e.g., mask voting).…”
Section: Related Workmentioning
confidence: 99%
“…Our approach might seem surprising, as the general consensus around instance segmentation is that because FCNs are translation invariant, the task needs translation variance added back in [3]. Thus methods like FCIS [3] and Mask R-CNN [2] try to explicitly add translation variance, whether it be by directional maps and position-sensitive repooling, or by putting the mask branch in the second stage so it does not have to deal with localizing instances. In our method, the only translation variance we add is to crop the final mask with the predicted bounding box.…”
Section: Emergent Behaviormentioning
confidence: 99%
“…Many factors contribute to this; among them, large datasets play crucial roles. Visual datasets with labels are used to train and evaluate machine learning models and lead to success in computer vision with novel architectures, such as AlexNet [1], Faster-RCNN [2], and FCIS [3].…”
Section: Introduction Creating Machines That Can Solve Complex Promentioning
confidence: 99%
“…Instead of labeling all pixels, it focuses on the target objects and labels only pixels of those objects. FCIS [25] is a technique developed based on fully convolutional networks (FCN). Mask R-CNN [26] is also created on top of FCN but incorporates with a proposed joint formulation.…”
Section: Deep Learning For Semantic Segmentationmentioning
confidence: 99%