Object Instance Segmentation and Fine-Grained Localization Using Hypercolumns

Hariharan, Bharath; Pont-Tuset, Jordi; Girshick, Ross; Malik, Jitendra

doi:10.1109/tpami.2016.2578328

Cited by 87 publications

(31 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Faster RCNN [229,230]: Although Fast RCNN significantly sped up the detection process, it still relies on external region proposals, whose computation is exposed as the new speed bottleneck in Fast RCNN. Recent work has shown that CNNs have a remarkable ability to localize objects in CONV layers [317,318,46,200,97], an ability which is weakened in the FC layers. Therefore, the selective search can be replaced by a CNN in producing region proposals.…”

Section: Region Based (Two Stage) Frameworkmentioning

confidence: 99%

“…(1) Detecting with combined features of multiple CNN layers: Many approaches, including Hypercolumns [97], HyperNet [135], and ION [11], combine features from multiple layers before making a prediction. Such feature combination is commonly accomplished via concatenation, a classic neural network idea that concatenates features from different layers, architectures which have recently become popular for semantic segmentation [177,241,97]. As shown in Fig.…”

Section: Handling Of Object Scale Variationsmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning for Generic Object Detection: A Survey

et al. 2019

View full text Add to dashboard Cite

Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful strategy for learning feature representations directly from data and have led to remarkable breakthroughs in the field of generic object detection. Given this period of rapid evolution, the goal of this paper is to provide a comprehensive survey of the recent achievements in this field brought about by deep learning techniques. More than 300 research contributions are included in this survey, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics. We finish the survey by identifying promising directions for future research.

show abstract

Section: Region Based (Two Stage) Frameworkmentioning

confidence: 99%

Section: Handling Of Object Scale Variationsmentioning

confidence: 99%

Deep Learning for Generic Object Detection: A Survey

et al. 2019

View full text Add to dashboard Cite

show abstract

“…Our approach takes advantage of the previous insights, and consists of a modularized network that exploits both the possibility of segmentation based on combinations of multi-domain information, and the feasibility of producing filters that respond to objects being referred to by processing the linguistic information. Following the spirit of [24][25] [26], we use skip connections between the downsampling process and the upsampling module to output finely-defined segmentations. We employ the concatenation strategy of [3] but include richer visual and language features.…”

Section: Recurrent Multimodal Interactionmentioning

confidence: 99%

Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries

Margffoy-Tuay

Pérez

Botero

et al. 2018

Lecture Notes in Computer Science

156

View full text Add to dashboard Cite

We address the problem of segmenting an object given a natural language expression that describes it. Current techniques tackle this task by either (i) directly or recursively merging linguistic and visual information in the channel dimension and then performing convolutions; or by (ii) mapping the expression to a space in which it can be thought of as a filter, whose response is directly related to the presence of the object at a given spatial coordinate in the image, so that a convolution can be applied to look for the object. We propose a novel method that integrates these two insights in order to fully exploit the recursive nature of language. Additionally, during the upsampling process, we take advantage of the intermediate information generated when downsampling the image, so that detailed segmentations can be obtained. We compare our method against the state-of-the-art approaches in four standard datasets, in which it surpasses all previous methods in six of eight of the splits for this task.

show abstract

“…Convolutional neural networks [8,9], originally proposed by LeCun et al for handwritten digit recognition, have been recently succeeded in image identification, detection, and segmentation tasks [10][11][12][13][14][15]. CNN is proved to have a strong ability in large scale image classification.…”

Section: Convolutional Neural Networkmentioning

confidence: 99%

Multi-Input Convolutional Neural Network for Flower Grading

Sun

Zhu

Wang

et al. 2017

Journal of Electrical and Computer Engineering

View full text Add to dashboard Cite

Flower grading is a significant task because it is extremely convenient for managing the flowers in greenhouse and market. With the development of computer vision, flower grading has become an interdisciplinary focus in both botany and computer vision. A new dataset named BjfuGloxinia contains three quality grades; each grade consists of 107 samples and 321 images. A multi-input convolutional neural network is designed for large scale flower grading. Multi-input CNN achieves a satisfactory accuracy of 89.6% on the BjfuGloxinia after data augmentation. Compared with a single-input CNN, the accuracy of multi-input CNN is increased by 5% on average, demonstrating that multi-input convolutional neural network is a promising model for flower grading. Although data augmentation contributes to the model, the accuracy is still limited by lack of samples diversity. Majority of misclassification is derived from the medium class. The image processing based bud detection is useful for reducing the misclassification, increasing the accuracy of flower grading to approximately 93.9%.

show abstract

Object Instance Segmentation and Fine-Grained Localization Using Hypercolumns

Cited by 87 publications

References 50 publications

Deep Learning for Generic Object Detection: A Survey

Deep Learning for Generic Object Detection: A Survey

Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries

Multi-Input Convolutional Neural Network for Flower Grading

Contact Info

Product

Resources

About