Yih-Liang Shen scite author profile

Yih-Liang Shen

5Publications

17Citation Statements Received

37Citation Statements Given

How they've been cited

How they cite others

Affiliations

National Yang Ming Chiao Tung University

Publications

Order By: Most citations

Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition

Shen

Huang

Wang³

et al. 2019

View full text Add to dashboard Cite

Conventional deep neural network (DNN)-based speech enhancement (SE) approaches aim to minimize the mean square error (MSE) between enhanced speech and clean reference. The MSE-optimized model may not directly improve the performance of an automatic speech recognition (ASR) system. If the target is to minimize the recognition error, the recognition results should be used to design the objective function for optimizing the SE model. However, the structure of an ASR system, which consists of multiple units, such as acoustic and language models, is usually complex and not differentiable. In this study, we proposed to adopt the reinforcement learning algorithm to optimize the SE model based on the recognition results. We evaluated the propsoed SE system on the Mandarin Chinese broadcast news corpus (MATBN). Experimental results demonstrate that the proposed method can effectively improve the ASR results with a notable 12.40% and 19.23% error rate reductions for signal to noise ratio at 0 dB and 5 dB conditions, respectively.Index Terms-reinforcement learning, automatic speech recognition, speech enhancement, deep neural network, character error rate

show abstract

Forecasting electricity market prices: a neural network based approach

Hsieh

et al.

View full text Add to dashboard Cite

Perceptual Characteristics Based Multi-objective Model for Speech Enhancement

Peng¹,

Chan²,

Shen³

et al. 2022

View full text Add to dashboard Cite

Active acoustic scene monitoring through spectro-temporal modulation filtering for intruder detection

Cheong

Shen

Chi

2022

View full text Add to dashboard Cite

An indoor acoustic scene monitoring system using a periodic impulse signal was previously developed. Compared with the impulse signal, the chirp signal is more robust against environmental noise due to its specific spectro-temporal structure. Such specific structure makes the chirp sound easily detected using a spectro-temporal modulation filtering mechanism. In this paper, we demonstrated a system that employs a two-dimensional spectro-temporal filtering mechanism on a Fourier spectrogram to measure the total energy of the reverberations of the chirp signal as the representation of the acoustic scene. The system compares the reverberation energy difference between consecutive chirps with a predefined threshold to automatically detect the change in the acoustic scene. Simulations were conducted in real living rooms with various types of background noise. Test results demonstrated that the proposed system is much more effective than previously developed systems for detecting the acoustic scene changes due to the intruder silently walking in the rooms. In the biggest test room (18 × 9.8 × 2.5 m3) with heavy background noise, the proposed system can still yield a correct identification rate higher than 80% to the intruder walking at 7 m from the microphone without producing any false alarms.

show abstract

Plastic multi-resolution auditory model based neural network for speech enhancement

Lai

Shen

et al. 2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yih-Liang Shen

Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition

Forecasting electricity market prices: a neural network based approach

Perceptual Characteristics Based Multi-objective Model for Speech Enhancement

Active acoustic scene monitoring through spectro-temporal modulation filtering for intruder detection

Plastic multi-resolution auditory model based neural network for speech enhancement

Contact Info

Product

Resources

About