Paris Smaragdis scite author profile

Paris Smaragdis

5Publications

1,644Citation Statements Received

22Citation Statements Given

How they've been cited

2,245

1,635

How they cite others

Affiliations

University of Illinois Urbana-Champaign, Adobe Systems (United States), Mitsubishi Electric (United States)

Publications

Order By: Most citations

Non-negative matrix factorization for polyphonic music transcription

View full text Add to dashboard Cite

In this paper we present a methodology for analyzing polyphonic musical passages comprised by notes that exhibit a harmonically fixed spectral profile (such as piano notes). Taking advantage of this unique note structure we can model the audio content of the musical passage by a linear basis transform and use non-negative matrix decomposition methods to estimate the spectral profile and the temporal information of every note. This approach results in a very simple and compact system that is not knowledge based, but rather learns notes by observation. WASPAA 2003This work may not be copied or reproduced in whole or in part for any commercial purpose. Permission to copy in whole or in part without payment of fee is granted for nonprofit educational and research purposes provided that all such whole or partial copies include the following: a notice that such copying is by permission of Mitsubishi Electric Research Laboratories, Inc.; an acknowledgment of the authors and individual contributions to the work; and all applicable portions of the copyright notice. Copying, reproduction, or republishing for any other purpose shall require a license with payment of fee to Mitsubishi Electric Research Laboratories, Inc. All rights reserved. ABSTRACTIn this paper we present a methodology for analyzing polyphonic musical passages comprised by notes that exhibit a harmonically fixed spectral profile (such as piano notes). Taking advantage of this unique note structure we can model the audio content of the musical passage by a linear basis transform and use non-negative matrix decomposition methods to estimate the spectral profile and the temporal information of every note. This approach results in a very simple and compact system that is not knowledge-based, but rather learns notes by observation.

show abstract

Blind separation of convolved mixtures in the frequency domain

1998

View full text Add to dashboard Cite

Convolutive Speech Bases and Their Application to Supervised Speech Separation

Smaragdis¹

2007

IEEE Trans. Audio Speech Lang. Process.

338

293

View full text Add to dashboard Cite

Deep learning for monaural speech separation

et al. 2014

View full text Add to dashboard Cite

Monaural source separation is useful for many real-world applications though it is a challenging problem. In this paper, we study deep learning for monaural speech separation. We propose the joint optimization of the deep learning models (deep neural networks and recurrent neural networks) with an extra masking layer, which enforces a reconstruction constraint. Moreover, we explore a discriminative training criterion for the neural networks to further enhance the separation performance. We evaluate our approaches using the TIMIT speech corpus for a monaural speech separation task. Our proposed models achieve about 3.8⇠4.9 dB SIR gain compared to NMF models, while maintaining better SDRs and SARs.

show abstract

Singing-voice separation from monaural recordings using robust principal component analysis

et al. 2012

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Paris Smaragdis

Non-negative matrix factorization for polyphonic music transcription

Blind separation of convolved mixtures in the frequency domain

Convolutive Speech Bases and Their Application to Supervised Speech Separation

Deep learning for monaural speech separation

Singing-voice separation from monaural recordings using robust principal component analysis

Contact Info

Product

Resources

About