Intelligent Data Engineering and Automated Learning - IDEAL 2007
DOI: 10.1007/978-3-540-77226-2_63
|View full text |Cite
|
Sign up to set email alerts
|

Segmentation and Annotation of Audiovisual Recordings Based on Automated Speech Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 20 publications
(7 citation statements)
references
References 11 publications
0
7
0
Order By: Relevance
“…Due to the fact that the slides carried most of the information, Repp et al synchronized the imperfect transcript from the speech recognition engine automatically with the slide streams in post-processing [19]. Most approaches use out-of-the-box speech recognition engines which, for example, extract key phrases from spoken content [7].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Due to the fact that the slides carried most of the information, Repp et al synchronized the imperfect transcript from the speech recognition engine automatically with the slide streams in post-processing [19]. Most approaches use out-of-the-box speech recognition engines which, for example, extract key phrases from spoken content [7].…”
Section: Related Workmentioning
confidence: 99%
“…So each LO has a duration of approximately 1.5 minutes. The synchronization between the power point slides and the erroneous transcript in a post-processing process is explored in [19] for the cases where no log file exists with time-stamps for each slide transition.…”
Section: Second Test: Lo With the Slidesmentioning
confidence: 99%
“…Chen et al [3] attempted to automatically synchronize presentation slides with the speaker video. Repp et al [10] proposed the segmentation and annotation of audiovisual recordings based on automated speech recognition. Recently, Bhatt et al [1] and Che et al [2] attempted to automatically determine the temporal segmentation and annotation for lecture videos.…”
Section: Related Workmentioning
confidence: 99%
“…Due to the fact that the Pow-erPoint slides carried most of the information, Repp et al synchronized the imperfect transcript from the speech recognition engine automatically with the slide streams in postprocessing [14].…”
Section: Related Workmentioning
confidence: 99%
“…deleting stop-words and stemming of the words -the stems are stored in a database. This part of our system has already been described in [13,14].…”
Section: Identification Of Relevant Abstractmentioning
confidence: 99%