2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.89
|View full text |Cite
|
Sign up to set email alerts
|

Deformable Convolutional Networks

Abstract: Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in their building modules. In this work, we introduce two new modules to enhance the transformation modeling capability of CNNs, namely, deformable convolution and deformable RoI pooling. Both are based on the idea of augmenting the spatial sampling locations in the modules with additional offsets and learning the offsets from the target tasks, without additional supervision. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

9
3,415
2
14

Year Published

2017
2017
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 5,401 publications
(3,440 citation statements)
references
References 50 publications
9
3,415
2
14
Order By: Relevance
“…Existing approaches basically follow two directions: deformable filter and rotating filer. In [3], a deformable convolution filter was introduced to enhance DCNNs' capacity of modeling geometric transformations by allowing free form deformation of the sampling grid with offsets learned from the preceding feature maps. However, the deformable filtering is complicated, because it is always associated with the Region of Interest (RoI) pooling technique originally designed for object detection [4].…”
mentioning
confidence: 99%
“…Existing approaches basically follow two directions: deformable filter and rotating filer. In [3], a deformable convolution filter was introduced to enhance DCNNs' capacity of modeling geometric transformations by allowing free form deformation of the sampling grid with offsets learned from the preceding feature maps. However, the deformable filtering is complicated, because it is always associated with the Region of Interest (RoI) pooling technique originally designed for object detection [4].…”
mentioning
confidence: 99%
“…In Dai et al [19], deformable convolution was achieved by augmenting the input feature map with 2D offsets during convolution. For better understanding from the image processing perspective, we formulized the deformable convolution as follows:…”
Section: Deformable Convolutionmentioning
confidence: 99%
“…In our work, the CNN architecture was developed according to Dai et al [19], but a different training strategy is used. As shown in Figure 2, based on R-FCN that contains fully convolutional feature maps, RoI pooling and RPN, we used ResNet101 ImageNet pre-trained parameters as the initial values and substitute res5, res4b22, res4b21 and res4b20 layers by deformable convolution layers.…”
Section: Deformable R-fcnmentioning
confidence: 99%
See 1 more Smart Citation
“…In Ref. [14], deformable convolutions are used to reformulate the sampling process in convolutions in a learning-based approach. Deformable convolutions can also be regarded as a way of reallocating convolutional weights.…”
Section: Affine Transformation In Deep Networkmentioning
confidence: 99%