2019
DOI: 10.1613/jair.1.11338
|View full text |Cite
|
Sign up to set email alerts
|

Multi-scale Hierarchical Residual Network for Dense Captioning

Abstract: Recent research on dense captioning based on the recurrent neural network and the convolutional neural network has made a great progress. However, mapping from an image feature space to a description space is a nonlinear and multimodel task, which makes it difficult for the current methods to get accurate results. In this paper, we put forward a novel approach for dense captioning based on hourglass-structured residual learning. Discriminant feature maps are obtained by incorporating dense… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 22 publications
(5 citation statements)
references
References 27 publications
0
5
0
Order By: Relevance
“…To stabilize the training of deep network, we introduce residual network blocks and skip connections to the network, which can increase training efficiency, and achieve high accuracy as well [19]. The residual networks (also known as ResNet) have been successfully applied to a range of diverse areas of research [17,20].…”
Section: Introductionmentioning
confidence: 99%
“…To stabilize the training of deep network, we introduce residual network blocks and skip connections to the network, which can increase training efficiency, and achieve high accuracy as well [19]. The residual networks (also known as ResNet) have been successfully applied to a range of diverse areas of research [17,20].…”
Section: Introductionmentioning
confidence: 99%
“…Hyperparameters are also important in our approach, such as mixture coefficients in Eqs. 3, (4), and (7). The longterm dependency provides a context cue to increase the discriminant capacity for pixel-wise tasks, such as medical image segmentation.…”
Section: Discussionmentioning
confidence: 99%
“…Recently, medical image segmentation has made considerable achievement owing to convolutional neural network with cascade structure 1 and multiscale analysis. [2][3][4] However, accurately classifying each pixel is still challenging in MRI databased brain tumor segmentation. The MRI data have relatively low image contrast, and different types of tumors present ambiguous patterns.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, deep learning networks and especially convolutional neural networks (CNNs) have achieved remarkable success in many computer vision areas, including object recognition [1] [2], semantic segmentation [3] [4] [5], depth estimation [6], object detection [7] [8], etc. Instead of crafting features by humans, CNNs use convolutional layers to automatically extract the features from the input images and provide end-to-end solutions to the perceptional task.…”
Section: Introductionmentioning
confidence: 99%