Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology 2013
DOI: 10.1145/2501988.2501993
|View full text |Cite
|
Sign up to set email alerts
|

Content-based tools for editing audio stories

Abstract: Audio stories are an engaging form of communication that combine speech and music into compelling narratives. Existing audio editing tools force story producers to manipulate speech and music tracks via tedious, low-level waveform editing. In contrast, we present a set of tools that analyze the audio content of the speech and music and thereby allow producers to work at much higher level. Our tools address several challenges in creating audio stories, including (1) navigating and editing speech, (2) selecting … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
70
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 65 publications
(70 citation statements)
references
References 25 publications
0
70
0
Order By: Relevance
“…Instead, we align the transcripts to speech audio, producing much greater timing precision and accuracy. Transcripts are force-aligned to speech-only audio for each episode using dynamic programming and an American-English language model, as in [24]. Alignment results in word and phoneme boundaries, as shown in Fig 2(a).…”
Section: Character Data Collectionmentioning
confidence: 99%
“…Instead, we align the transcripts to speech audio, producing much greater timing precision and accuracy. Transcripts are force-aligned to speech-only audio for each episode using dynamic programming and an American-English language model, as in [24]. Alignment results in word and phoneme boundaries, as shown in Fig 2(a).…”
Section: Character Data Collectionmentioning
confidence: 99%
“…Rubin et al [22] focus on the process of editing speech and adding music to audio stories. Their system offers a highlevel editing workflow and requires a user-in-the-loop to refine speech and music edits.…”
Section: Related Workmentioning
confidence: 99%
“…Wenner et al [34] and Rubin et al [22] present methods for music retargeting to find optimal paths through music that match constraints of videos or audio stories. Our work builds on these algorithms and describes new constraints to match emotions of the speech and music, add pauses with flexible length, bound music segment lengths, and create scores from multiple music tracks.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations