Names and faces in the news

Berg, Tamara L.; Berg, Alexander C.; Edwards, Jaety; Maire, Michael; White, Ryan M.; Teh, Yee-Whye; Learned-Miller, Eric; Forsyth, David

doi:10.1109/cvpr.2004.1315253

Cited by 284 publications

(310 citation statements)

References 19 publications

Supporting

Mentioning

296

Contrasting

Unclassified

Order By: Relevance

“…no text). The difference in the difficulty is apparent by comparing the examples in [6] with those used for evaluation in §3. For example, in [6] the face image size is restricted to be at least 86 × 86 pixels, whilst a significant number of faces we use are of lower resolution.…”

Section: Previous Workmentioning

confidence: 99%

“…Fitzgibbon and Zisserman [14] investigated face clustering in feature films, though without explicitly using facial features for registration. Berg et al [6] consider the problem of clustering detected frontal faces extracted from web news pages. In a similar manner to us, affine registration with an underlying SVM-based facial feature detector is used for face rectification.…”

Section: Previous Workmentioning

confidence: 99%

“…A related approach was described in [6]; alternative methods include pictorial structures [13], shape+appearance cascaded classifiers [15] and the method of Cristinacce et al [10]. We represent each facial feature, i.e.…”

Section: Facial Feature Detectionmentioning

confidence: 99%

See 2 more Smart Citations

On Film Character Retrieval in Feature-Length Films

Arandjelović

Zisserman

2006

Interactive Video

View full text Add to dashboard Cite

Section: Previous Workmentioning

confidence: 99%

Section: Previous Workmentioning

confidence: 99%

See 1 more Smart Citation

On Film Character Retrieval in Feature-Length Films

Arandjelović

Zisserman

2006

Interactive Video

View full text Add to dashboard Cite

“…Sentence semantics only provides ambiguous and implicit labels. This resembles another line of work that learns structured output from image captions (Berg et al 2004;Gupta and Davis 2008;Luo et al 2009;Jamieson et al 2010a, b;Plummer et al 2015;Mao et al 2016), treating the input as a parallel image-text dataset. However, all of these methods, except Gupta and Davis (2008) and Jamieson et al (2010a, b) use pretrained object models learned from other datasets.…”

Section: Related Workmentioning

confidence: 99%

Sentence Directed Video Object Codiscovery

Yu¹,

Siskind

2017

Int J Comput Vis

View full text Add to dashboard Cite

Video object codiscovery can leverage the weak semantic constraint implied by sentences that describe the video content. Our codiscovery method, like other object codetection techniques, does not employ any pretrained object models or detectors. Unlike most prior work that focuses on codetecting large objects which are usually salient both in size and appearance, our method can discover small or medium sized objects as well as ones that may be occluded for part of the video. More importantly, our method can codiscover multiple object instances of different classes within a single video clip. Although the semantic information employed is usually simple and weak, it can greatly boost performance by constraining the hypothesized object locations. Experiments show promising results on three datasets: an average IoU score of 0.423 on a new dataset with 15 object

show abstract

“…Everingham et al [26,27] addressed the problem of automatically labeling faces of characters in TV or film materials with their names. Similar to the "Faces in the News" labeling in [16], where detected frontal faces in news images are tagged with names appearing in the news story text, they proposed to combine visual cues (face and cloth) and textual cues (subtitle and transcript) for assigning names. Regarding face processing [3], face detections in each frame are linked to derive face tracks, and each face is represented by local appearance descriptors computed around 13 facial features.…”

Section: Face Retrieval In Videomentioning

confidence: 99%

Face Recognition and Retrieval in Video

Shan¹

2010

Video Search and Mining

View full text Add to dashboard Cite

Abstract. Automatic face recognition has long been established as one of the most active research areas in computer vision. Face recognition in unconstrained environments remains challenging for most practical applications. In contrast to traditional still-image based approaches, recently the research focus has shifted towards videobased approaches. Video data provides rich and redundant information, which can be exploited to resolve the inherent ambiguities of image-based recognition like sensitivity to low resolution, pose variations and occlusion, leading to more accurate and robust recognition. Face recognition has also been considered in the content-based video retrieval setup, for example, character-based video search. In this chapter, we review existing research on face recognition and retrieval in video. The relevant techniques are comprehensively surveyed and discussed.

show abstract

Names and faces in the news

Cited by 284 publications

References 19 publications

On Film Character Retrieval in Feature-Length Films

On Film Character Retrieval in Feature-Length Films

Sentence Directed Video Object Codiscovery

Face Recognition and Retrieval in Video

Contact Info

Product

Resources

About