2008
DOI: 10.1109/tpami.2008.128
|View full text |Cite
|
Sign up to set email alerts
|

80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

Abstract: Abstract-

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

15
1,062
0
8

Year Published

2011
2011
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 1,724 publications
(1,085 citation statements)
references
References 36 publications
15
1,062
0
8
Order By: Relevance
“…of Arxiv General Relativity) has 5.242 nodes and 28.980 edges whereas the largest graph (Amazon product copurchasing network from June 1) has 403, 394 nodes and 3, 387, 388 edges. To demonstrate the principal suitability of LS-DCUR for processing very large, dense matrices, we consider a data set of 80 million Google TM images [29]. Viewing each image (represented as a vector of gist descriptors) as a column, this results in a gigantic matrix containing 3072 · 10 7 elements.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…of Arxiv General Relativity) has 5.242 nodes and 28.980 edges whereas the largest graph (Amazon product copurchasing network from June 1) has 403, 394 nodes and 3, 387, 388 edges. To demonstrate the principal suitability of LS-DCUR for processing very large, dense matrices, we consider a data set of 80 million Google TM images [29]. Viewing each image (represented as a vector of gist descriptors) as a column, this results in a gigantic matrix containing 3072 · 10 7 elements.…”
Section: Methodsmentioning
confidence: 99%
“…When dealing with today's large-scale data sets, many data mining practitioners therefore often abandon deterministic approaches and resort to randomized approaches. However, huge data such as collections of online books at Amazon TM , image repositories at Flickr TM or Google TM , or personal health records [16,19,23,24,29] are becoming ever more common and thus pose a challenge to research on interpretable matrix factorization.…”
Section: Introductionmentioning
confidence: 99%
“…Li et al [25] presented a topdown saliency approach to incorporate low-level features and the objectness measure via label propagation. Generally, such task-driven methods are useful especially for object recognition [7], but they require knowledge learning that increases the complexity of saliency detection in general.…”
Section: A Top-down Saliency Detectionmentioning
confidence: 99%
“…From the perspective of information processing, those saliency algorithms can be broadly categorized as either top-down or bottom-up approaches. Top-down approaches [7]- [10] are goal-directed and usually adopt supervised learning with a specific class. Most of the saliency detection methods are based on bottomup visual attention mechanisms [11]- [15], [17], [18], [21], which are independent of the knowledge of the content in the image and utilize various low level features, such as intensity, color and orientation.…”
Section: Introductionmentioning
confidence: 99%
“…is generated from the TinyImages collection [24], which provides 80 million color images of size 32 × 32. For each image, we first convert it to grayscale, compute its two-dimensional DCT (Discrete Cosine Transform), and then keep only the top 2% largest coefficients in magnitude.…”
Section: Numerical Experimentsmentioning
confidence: 99%