2008 IEEE Conference on Computer Vision and Pattern Recognition 2008
DOI: 10.1109/cvpr.2008.4587621
|View full text |Cite
|
Sign up to set email alerts
|

Filtering Internet image search results towards keyword based category recognition

Abstract: In this work we aim to capitalize on the availability of Internet image search engines to automatically create image training sets from user provided queries. This problem is particularly difficult due to the low precision of image search results. Unlike many existing dataset gathering approaches, we do not assume a category model based on a small subset of the noisy data or an ad-hoc validation set. Instead we use a nonparametric measure of strangeness [8] in the space of holistic image representations, and p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(25 citation statements)
references
References 15 publications
0
25
0
Order By: Relevance
“…Unfortunately, the search results are still unsatisfactory because of the relatively low precision ratio and the existence of large number of junk images [1,2,3,4]. One big reason for this drawback is due to the fact that Google Images simplifies the image search problem as a text-based problem: the images are not indexed by their appearance but by text keywords which are extracted from the context of the images.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Unfortunately, the search results are still unsatisfactory because of the relatively low precision ratio and the existence of large number of junk images [1,2,3,4]. One big reason for this drawback is due to the fact that Google Images simplifies the image search problem as a text-based problem: the images are not indexed by their appearance but by text keywords which are extracted from the context of the images.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, many real world web applications, such as photo-sharing websites, may only be able to provide noisy tags for image annotation which could further misguide keyword-based image search engines such as Google Images. Therefore, there is an urgent need to develop new algorithms for filtering out junk images from Google Images [1,2,3,4].…”
Section: Introductionmentioning
confidence: 99%
“…In [6], the TSI-pLSA method is presented for image categorization based on a visual vocabulary, where the performance heavily relies on the quality of the training data. Wnuk et al [7] propose a nonparametric measure of strangeness based on visual characteristics of images. It neglects the role of textual features in capturing image semantics.…”
Section: Related Workmentioning
confidence: 99%
“…However, it would be reluctant or difficult for many users to provide the query image or the query image is difficult to be given. And some applications also need to harvest images for one query from the web [20,24].…”
Section: Introductionmentioning
confidence: 99%