2019
DOI: 10.1109/tmm.2019.2895511
|View full text |Cite
|
Sign up to set email alerts
|

End-to-End Automatic Image Annotation Based on Deep CNN and Multi-Label Data Augmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
40
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 88 publications
(40 citation statements)
references
References 38 publications
0
40
0
Order By: Relevance
“…Many studies have been conducted on using photo contents to extract travel location information, which is based on color, texture, or shape representation [25][26][27]. Ke et al [25] proposed a method that transforms the photo annotation problem into a multi-label learning problem. Kuang et al [26] estimated visual information from photos by associated tags in a local region.…”
Section: Using Extra Data To Address the Travel Location Cold Start Pmentioning
confidence: 99%
“…Many studies have been conducted on using photo contents to extract travel location information, which is based on color, texture, or shape representation [25][26][27]. Ke et al [25] proposed a method that transforms the photo annotation problem into a multi-label learning problem. Kuang et al [26] estimated visual information from photos by associated tags in a local region.…”
Section: Using Extra Data To Address the Travel Location Cold Start Pmentioning
confidence: 99%
“…Based on initial labels extracted from the CNNs and initial labels of possibly user-defined tags, the event categories and final annotations of the images were estimated through a refinement process based on the expectation-maximization (EM) algorithm. Ke et al [ 19 ] developed an end-to-end automatic image annotation model based on a deep convolutional neural network (E2E-DCNN) and multilabel data augmentation. A deep CNN structure was adopted for adaptive feature learning, in which the cross-entropy loss functions were first used to construct an end-to-end annotation structure for training, and Wasserstein generative adversarial networks were used for multilabel data augmentation.…”
Section: Related Workmentioning
confidence: 99%
“…Three benchmark dataset which are used widely are selected for evaluating the proposed work. Corel5K [43], Espgame [47], and Iaprtc12 [45]. Corel5K contains 5000 images which classify 4000 images as train image and 1000 images as test images.…”
Section: A Datasetsmentioning
confidence: 99%