Wen-Hsing Lai scite author profile

Cardiovascular diseases have a high morbidity, and remain the leading cause of mortality. In the past two decades, developing an intelligent auscultation system has attracted tremendous efforts from the field of signal processing and machine learning. We propose a novel framework based on wavelet representations and deep recurrent neural networks for recognising three heart sounds, i. e., normal, mild, and severe. The Heart Sounds Shenzhen corpus (n = 170) is used to validate the proposed method. The experimental results demonstrate the efficacy of the proposed method in a rigorous subject independent scenario, which can reach an unweighted average recall at 43.0 % (chance level: 33.3 %).

show abstract

RPCA-DRNN technique for monaural singing voice separation

Lai

Wang

2022

J AUDIO SPEECH MUSIC PROC.

View full text Add to dashboard Cite

In this study, we propose a methodology for separating a singing voice from musical accompaniment in a monaural musical mixture. The proposed method uses robust principal component analysis (RPCA), followed by postprocessing, including median filter, morphology, and high-pass filter, to decompose the mixture. Subsequently, a deep recurrent neural network comprising two jointly optimized parallel-stacked recurrent neural networks (sRNNs) with mask layers and trained on limited data and computation is applied to the decomposed components to optimize the final estimated separated singing voice and background music to further correct misclassified or residual singing and background music in the initial separation. The experimental results of MIR-1K, ccMixter, and MUSDB18 datasets and the comparison with ten existing techniques indicate that the proposed method achieves competitive performance in monaural singing voice separation. On MUSDB18, the proposed method reaches the comparable separation quality in less training data and lower computational cost compared to the other state-of-the-art technique.

show abstract

Analysis of syllable duration models for Mandarin speech

Lai

Chen

2002

View full text Add to dashboard Cite

In this paper, the multiplicative syllable duration model proposed previously for Mandarin speech is extended in some aspects. First, the three basic Tone 3 patt erns (i.e., full lone, half lone and sandhi tone) are properly considered via using three different companding factors (CFs) to separate their affections. Second, the CPs of the model are analyzed in detail. Third, the syllable duration modeling method is applied 10 an automatically-segmented, SOO-speaker, telephone-speech database. Fourth, a comparative study 10 paraUelly construct an add itive syllable duration model is done.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wen-Hsing Lai

The Impact of Cloud Computing Technology on Legal Infrastructure within Internet—Focusing on the Protection of Information Privacy

Deep Wavelets for Heart Sound Classification

RPCA-DRNN technique for monaural singing voice separation

Analysis of syllable duration models for Mandarin speech

Contact Info

Product

Resources

About