A comparison of molecular approaches for generating sparse and structured multiresolution representations of audio and music signals

Sturm, Bob L.; Shynk, J.J.; McLeran, Aaron; Roads, Curtis; Daudet, Laurent

doi:10.1121/1.2935490

Cited by 2 publications

(1 citation statement)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We have designed an algorithm that builds molecular representations from atomic ones [Sturm et al, 2008c, Sturm et al, 2008d, Sturm, 2009. Each molecule is a group of atoms that act together to represent a high-level feature.…”

Section: Higher-level Representations Through Moleculesmentioning

confidence: 99%

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods^†

Sturm

Roads

McLeran

et al. 2009

Journal of New Music Research

Self Cite

View full text Add to dashboard Cite

In this article we provide an overview of dictionary-based methods (DBMs) -also called sparse approximation -and review recent work in the application of such methods to working with signals, in particular audio and music signals. As Fourier analysis is to additive synthesis, DBMs can be seen as the analytical counterpart to granular synthesis since a signal is rebuilt by a linear combination of heterogeneous atoms selected from a user-defined dictionary. We demonstrate how DBMs provide novel means for analyzing, visualizing, and transforming audio signals by creating multiresolution and parametric descriptions of their contents.

show abstract

Section: Higher-level Representations Through Moleculesmentioning

confidence: 99%

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods^†

Sturm

Roads

McLeran

et al. 2009

Journal of New Music Research

Self Cite

View full text Add to dashboard Cite

show abstract

Recursive nearest neighbor search in a sparse and multiscale domain for comparing audio signals

Sturm

Daudet

2011

Signal Processing

Self Cite

View full text Add to dashboard Cite

We investigate recursive nearest neighbor search in a sparse domain at the scale of audio signals. Essentially, to approximate the cosine distance between the signals we make pairwise comparisons between the elements of localized sparse models built from large and redundant multiscale dictionaries of time-frequency atoms. Theoretically, error bounds on these approximations provide efficient means for quickly reducing the search space to the nearest neighborhood of a given data; but we demonstrate here that the best bound defined thus far involving a probabilistic assumption does not provide a practical approach for comparing audio signals with respect to this distance measure. Our experiments show, however, that regardless of these non-discriminative bounds, we only need to make a few atom pair comparisons to reveal, e.g., the origin of an excerpted signal, or melodies with similar time-frequency structures.

show abstract

A comparison of molecular approaches for generating sparse and structured multiresolution representations of audio and music signals

Cited by 2 publications

References 11 publications

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods^†

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods^†

Recursive nearest neighbor search in a sparse and multiscale domain for comparing audio signals

Contact Info

Product

Resources

About

A comparison of molecular approaches for generating sparse and structured multiresolution representations of audio and music signals

Cited by 2 publications

References 11 publications

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods†

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods†

Recursive nearest neighbor search in a sparse and multiscale domain for comparing audio signals

Contact Info

Product

Resources

About

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods^†

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods^†