2016 IEEE Winter Conference on Applications of Computer Vision (WACV) 2016
DOI: 10.1109/wacv.2016.7477560
|View full text |Cite
|
Sign up to set email alerts
|

Naming TV characters by watching and analyzing dialogs

Abstract: Person identification in TV series has been a popular research topic over the last decade. In this area, most approaches either use manually annotated data or extract character supervision from a combination of subtitles and transcripts. However, both approaches have key drawbacks that hinder application of these methods at a large scale -manual annotation is expensive and transcripts are often hard to obtain. We investigate the topic of automatically labeling all character appearances in TV series using infor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
29
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
4

Relationship

2
6

Authors

Journals

citations
Cited by 31 publications
(29 citation statements)
references
References 28 publications
0
29
0
Order By: Relevance
“…Face track clustering/identification methods have also used additional cues such as clothing appearance [33], speech [22], voice models [19], context [46], gender [49], name mentions (first, second, and third person references) in subtitles [18], weak labels using transcripts/subtitles [7], [2], and joint action and actor labeling [16] using transcripts.…”
Section: Related Workmentioning
confidence: 99%
“…Face track clustering/identification methods have also used additional cues such as clothing appearance [33], speech [22], voice models [19], context [46], gender [49], name mentions (first, second, and third person references) in subtitles [18], weak labels using transcripts/subtitles [7], [2], and joint action and actor labeling [16] using transcripts.…”
Section: Related Workmentioning
confidence: 99%
“…Verification losses. Next, we analyze LDML, contrastive, and triplet losses ( Table 5 rows [10][11][12][13][14][15][16][17][18]. While these losses are often used to perform clustering, they are not designed for it [47].…”
Section: #Chmentioning
confidence: 99%
“…Characters are often studied by analyzing face tracks (sequences of temporally related detections) in videos. A significant part of this analysis is identification -labeling face tracks with their names, and typically employs supervision from web images [1,29], transcripts [3,9], or even dialogs [7,15]. We are interested in an equally popular alternative -clustering face tracks based on identity.…”
Section: Introductionmentioning
confidence: 99%
“…Weak Supervision for Actor Labelling The topic of labeling actors in movies and TV-series has been tackled mostly using weak supervision from subtitle and transcript files [1,8,9,10,11,12,13], building on the seminal work of Everingham et al [1,8]. The transcript tells us what each character says in the show, but is not aligned with the video.…”
Section: Related Workmentioning
confidence: 99%