2019 IEEE International Conference on Multimedia and Expo (ICME) 2019
DOI: 10.1109/icme.2019.00256
|View full text |Cite
|
Sign up to set email alerts
|

Large-Scale Datasets for Going Deeper in Image Understanding

Abstract: Significant progress has been achieved in Computer Vision by leveraging large-scale image datasets. However, large-scale datasets for complex Computer Vision tasks beyond classification are still limited. This paper proposed a large-scale dataset named AIC (AI Challenger) with three sub-datasets, human keypoint detection (HKD), large-scale attribute dataset (LAD) and image Chinese captioning (ICC). In this dataset, we annotate class labels (LAD), keypoint coordinate (HKD), bounding box (HKD and LAD), attribute… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
64
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 86 publications
(64 citation statements)
references
References 62 publications
0
64
0
Order By: Relevance
“…Compared to the SimpleBaseline [124] with the same input size, our small and big networks receive 1.2 and 1.8 improvements, respectively. With the additional data from AI Challenger [121] for training, our single big network can obtain an AP of 77.0.…”
Section: Methodsmentioning
confidence: 99%
“…Compared to the SimpleBaseline [124] with the same input size, our small and big networks receive 1.2 and 1.8 improvements, respectively. With the additional data from AI Challenger [121] for training, our single big network can obtain an AP of 77.0.…”
Section: Methodsmentioning
confidence: 99%
“…Compared to the SimpleBaseline [72] with the same input size, our small and big networks receive 1.2 and 1.8 improvements, respectively. With additional data from AI Challenger [70] for training, our single big network can obtain an AP of 77.0.…”
Section: Coco Keypoint Detectionmentioning
confidence: 99%
“…To evaluate the performance of multi-person pose estimation algorithms, several public benchmarks were established, such as MSCOCO [19], MPII [2] and AI Challenger [30]. In these benchmarks, the images are usually collected from daily life where crowded scenes appear less Figure 1.…”
Section: Introductionmentioning
confidence: 99%