Handbook of Multimedia for Digital Entertainment and Arts 2009
DOI: 10.1007/978-0-387-89024-1_17
|View full text |Cite
|
Sign up to set email alerts
|

Automated Music Video Generation Using Multi-level Feature-based Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(10 citation statements)
references
References 7 publications
0
10
0
Order By: Relevance
“…Wang et al also focused on the structure of a music clip and proposed to generate a music sports video by selecting the best video clip for each semantic music segment such as Intro, Verse, and Chorus considering their semantic content and tempo using certain user-defined rules [14]. Yoon et al proposed to conserve the intent of the video-maker by segmenting video and music clips into segments of near-uniform flow and selecting the best video segment for each music segment based on their novelty, velocity and brightness and by using timewarping [15]. While these work all tried to synthesize video clips to a music clip, Mulhem et al have proposed a method for synthesizing several music clips to a sequence of video clips by selecting the best music clip for a video clip based on heuristically determined rules about the aesthetic compatibility between video and music clips, more concretely, how the dynamic, motion, and pitch of video and music clips coincide with each other [16].…”
Section: Related Workmentioning
confidence: 99%
“…Wang et al also focused on the structure of a music clip and proposed to generate a music sports video by selecting the best video clip for each semantic music segment such as Intro, Verse, and Chorus considering their semantic content and tempo using certain user-defined rules [14]. Yoon et al proposed to conserve the intent of the video-maker by segmenting video and music clips into segments of near-uniform flow and selecting the best video segment for each music segment based on their novelty, velocity and brightness and by using timewarping [15]. While these work all tried to synthesize video clips to a music clip, Mulhem et al have proposed a method for synthesizing several music clips to a sequence of video clips by selecting the best music clip for a video clip based on heuristically determined rules about the aesthetic compatibility between video and music clips, more concretely, how the dynamic, motion, and pitch of video and music clips coincide with each other [16].…”
Section: Related Workmentioning
confidence: 99%
“…They usually extract suitable features from the video to be matched with features from the audio. In [2] and [3] an offline alignment is performed, which is not applicable to our problem due to the non-linear computational costs and the need to have complete sequences for the alignment.…”
Section: Related Workmentioning
confidence: 99%
“…The user can therefore, instead of just listening to his favourite music, watch the corresponding music video at the same time. Indeed, many services nowadays allow for music video streaming, for example Youtube 1 3 ; or music streaming, such as Spotify 4 , Pandora 5 or GrooveShark 6 . Due to bandwidth limitations the audio in the streamed videos is usually of lower quality than when streaming audio alone, and neither is comparable to locally stored high quality audio files or music on audio CDs.…”
Section: Introductionmentioning
confidence: 99%
“…However, there is no bias for a linear progression through both recordings, which is desirable in this case. The work by Yoon, Lee and Byun [4] is an example of aligning music with arbitrary videos by comparing multi-level features.…”
Section: Audio To Video Matchingmentioning
confidence: 99%
“…In addition, users can now pay monthly subscription fees to stream music and videos over their internet connection using online services such as Spotify 3 , Rhapsody 4 or Last.fm 5 .…”
Section: Introductionmentioning
confidence: 99%