2021
DOI: 10.48550/arxiv.2110.07096
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Identifying Introductions in Podcast Episodes from Automatically Generated Transcripts

Abstract: As the volume of long-form spoken-word content such as podcasts explodes, many platforms desire to present short, meaningful, and logically coherent segments extracted from the full content. Such segments can be consumed by users to sample content before diving in, as well as used by the platform to promote and recommend content. However, little published work is focused on the segmentation of spoken-word content, where the errors (noise) in transcripts generated by automatic speech recognition (ASR) services … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 26 publications
(27 reference statements)
0
2
0
Order By: Relevance
“…Midei and Mandic (2019) provide a podcast dataset for research in this domain and propose an LSTM and Universal Sentence Encoder (Cer et al, 2018) based sequence labeling model. Jing et al (2021) identify introductions in podcast transcripts using BERT (Devlin et al, 2019). Solbiati et al (2021) propose an unsupervised technique for meeting transcript segmentation.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Midei and Mandic (2019) provide a podcast dataset for research in this domain and propose an LSTM and Universal Sentence Encoder (Cer et al, 2018) based sequence labeling model. Jing et al (2021) identify introductions in podcast transcripts using BERT (Devlin et al, 2019). Solbiati et al (2021) propose an unsupervised technique for meeting transcript segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…However, only few works have addressed segmentation of transcripts (Midei and Mandic, 2019;Jing et al, 2021;Gruenstein et al, 2008). As shown in Figure 1, transcripts consist of a mix of short sentences, utterances, interjections and long form document style answers.…”
Section: Introductionmentioning
confidence: 99%