2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00610
|View full text |Cite
|
Sign up to set email alerts
|

Reliable and Efficient Image Cropping: A Grid Anchor Based Approach

Abstract: Image cropping aims to improve the composition as well as aesthetic quality of an image by removing extraneous content from it. Existing image cropping databases provide only one or several human-annotated bounding boxes as the groundtruth, which cannot reflect the non-uniqueness and flexibility of image cropping in practice. The employed evaluation metrics such as intersection-over-union cannot reliably reflect the real performance of cropping models, either. This work revisits the problem of image cropping, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
104
0
3

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 62 publications
(107 citation statements)
references
References 36 publications
0
104
0
3
Order By: Relevance
“…Following this idea, Wei et al (2018) increased the number of crops in each image and proposed the first densely annotated image cropping dataset, in which each image has 24 annotated crops with aesthetic scores. Zeng et al (2019) took a further step and presented another densely annotated dataset with about 85 crops for each image. In our method, we use such datasets to train our image cropping model.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Following this idea, Wei et al (2018) increased the number of crops in each image and proposed the first densely annotated image cropping dataset, in which each image has 24 annotated crops with aesthetic scores. Zeng et al (2019) took a further step and presented another densely annotated dataset with about 85 crops for each image. In our method, we use such datasets to train our image cropping model.…”
Section: Related Workmentioning
confidence: 99%
“…Based on the aesthetic score map, we aim to find the best crop by generating a lot of candidate crops and comparing their crop-level scores. Previous methods (Chen et al 2017b;Wei et al 2018;Zeng et al 2019) adopt different strategies in this step. Here, we design a simple yet effective searching strategy to reduce the searching space.…”
Section: Two-stage Search For Image Croppingmentioning
confidence: 99%
“…We show the retargeting results of our method and some previous methods in Figure 4 and Figure 5, where the retargeted size is 50% of the original size. The previous methods; scaling, GAIC [33], seam carving [21], MULTIOP [22], WSSDCNN [3], and Cycle-IR [26] were compared. The retargeting operators of scaling and seam carving were the same as the operators used in our method.…”
Section: Qualitative Evaluationmentioning
confidence: 99%
“…This trend became more pronounced as the number of operators increased. Figure 7: Qualitative comparison when the aesthetic score was used as a reward to generate 75% retargeted images : (a) original image, (b) SCL, (c) GAIC [33], (d) SC [21], (e) MULTIOP [22], (f) Our method, (g) Our method without both the self-play-based reward and the frequency-aware weighted loss, (h) Our method without the frequency-aware weighted loss. The number below each image represents the aesthetic score; a higher score means a more aesthetic image.…”
Section: Time Efficiencymentioning
confidence: 99%
“…The preprocessing stage consists of two parts: rescaling and obtaining optical flow maps of each eight consecutive video frames. The universal video format of today is 16:9 [26], which is an aspect ratio with a width of 16 units and a height of 9 units. In our study, each frame was extracted from the dataset videos and rescaled to 320 x 180.…”
Section: A Preprocessingmentioning
confidence: 99%