2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2010
DOI: 10.1109/cvpr.2010.5539985
|View full text |Cite
|
Sign up to set email alerts
|

Finding meaning on YouTube: Tag recommendation and category discovery

Abstract: We present a system that automatically recommends tags for YouTube videos solely based on their audiovisual content. We also propose a novel framework for unsupervised discovery of video categories that exploits knowledge mined from the World-Wide Web text documents/searches. First, video content to tag association is learned by training classifiers that map audiovisual content-based features from millions of videos on YouTube.com to existing uploadersupplied tags for these videos. When a new video is uploaded… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
48
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 74 publications
(48 citation statements)
references
References 19 publications
0
48
0
Order By: Relevance
“…The idea of automatic category discovery in this work shares some spirits with [24]. In [24], a set of entity categories are discovered by mining web pages and search queries.…”
Section: Related Workmentioning
confidence: 99%
“…The idea of automatic category discovery in this work shares some spirits with [24]. In [24], a set of entity categories are discovered by mining web pages and search queries.…”
Section: Related Workmentioning
confidence: 99%
“…64) and resize the frame to 10%. Resizing the frame and keeping only a small percentage helps to limit the computations, and does not affect the results, as the color information is not highly affected by image quality [5] [11]. Figure 1 present the original image and the quantized and resized copy.…”
Section: A Dominant Colormentioning
confidence: 99%
“…Having acquired information for each frame, it is relatively easy to translate it at the shot level, by employing the average face ratio from all the frames of the shot. In order to avoid having the shot results affected by a possibly very large (or a very small) BB, we first sort the vector that contains all the BB sizes for all the frames, and then select the median value, as in [1] [11]. Having extracted the face-frame ratios, we could use this information to extract the shot-type as in [7].…”
Section: Other Extracted Statisticsmentioning
confidence: 99%
“…Clustering methods are often subject to other systems, for example to reduce the possibility of recommender systems (e.g. Tag-recommender on Youtube videos [10]); for example clustering of large high-dimensional gene expression data sets has widespread application in -omics [11]. Unfortunately, the underlying structure of these natural data sets is often fuzzy, and the computational identification of data clusters generally requires (human) expert knowledge about cluster number and geometry.…”
Section: A Few Examples Of Visualization and Machine Learning Integramentioning
confidence: 99%