2010 International Workshop on Content Based Multimedia Indexing (CBMI) 2010
DOI: 10.1109/cbmi.2010.5529900
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting speaker segmentations for automatic role detection. An application to broadcast news documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…It outperforms the results in [6], which also used TIMIT synthesized data, based on algorithms in [5]. These results are still very competitive, compared to other algorithms using real world conversations [7], [13].…”
Section: Discussionmentioning
confidence: 79%
“…It outperforms the results in [6], which also used TIMIT synthesized data, based on algorithms in [5]. These results are still very competitive, compared to other algorithms using real world conversations [7], [13].…”
Section: Discussionmentioning
confidence: 79%
“…AVPD can be used in many different kinds of applications by both professionals and the general public. One of the most interesting applications of people diarization to video documents is the detection of major casts and their roles, for example, the anchor persons in TV news or principal characters in movies [6,10,11,14,18,19]. Their occurrences provide good indices for organizing and presenting video content.…”
Section: Introductionmentioning
confidence: 99%
“…The most commonly used are the Gaussian mixture models and the hidden Markov models. 10,11,14,26,37,40 Also widely used are the support vector machines, 11,14,38,39,41 the artificial neural networks, 10 the k-nearest neighbor algorithm, 14,38 the decision trees, 10,38 the genetic algorithms, 2 the fuzzy logic 42 and boosting techniques. 41,43 Related architectures incorporate fusion frameworks among recognition models 28,44 and combination of model-based and distance based algorithms.…”
Section: Introductionmentioning
confidence: 99%
“…[1][2][3][4][5][6][7][8][9] Concerning the audio data, the automatic analysis of the audio signals can offer the users useful information. In the case of broadcast news, automatic processing is related to tasks such as sound recognition, 10,11 speaker recognition, 12 anchor detection, 13 role detection, [14][15][16] story boundary detection, 2,17,18 summary construction from anchor talking, 9,19 channel's quality detection, 20 sound event detection, 21,22 non-linguistic humanproduced sounds detection, 5,6,[23][24][25] audio type segmentation in sport games, 4,26,27 highlight scene extraction from sports games, 3 violence scene detection, 28 music characteristics classification, 29,30 jingle detection, 1 commercial block detection, 8 voice activity detection, 31 language recognition, 32 emotion recognition 33 and speech recognition. 34 Sound recognition is the cornerstone of analysis as typically precedes the other stages.…”
Section: Introductionmentioning
confidence: 99%