Jinying Sun scite author profile

Lightweight speaker-dependent (SD) automatic speech recognition (ASR) is a promising solution for the problems of possibility of disclosing personal privacy and difficulty of obtaining training material for many seldom used English words and (often non-English) names. Dynamic time warping (DTW) algorithm is the state-of-the-art algorithm for small foot-print SD ASR applications, which have limited storage space and small vocabulary. In our previous work, we have successfully developed two fast and accurate DTW variations for clean speech data. However, speech recognition in adverse conditions is still a big challenge. In order to improve recognition accuracy in noisy and bad recording conditions, such as too high or low recording volume, we introduce a novel weighted DTW method. This method defines a feature index for each time frame of training data, and then applies it to the core DTW process to tune the final alignment score. With extensive experiments on one representative SD dataset of three speakers' recordings, our method achieves better accuracy than DTW, where 0.5% relative reduction of error rate (RRER) on clean speech data and 7.5% RRER on noisy and bad recording speech data. To the best of our knowledge, our new weighted DTW is the first weighted DTW method specially designed for speech data in noisy and bad recording conditions.

show abstract

Numerical and theoretical study of the crosstalk in gain clamped semiconductor optical amplifiers

Sun

Morthier

Baets

1997

IEEE J. Select. Topics Quantum Electron.

View full text Add to dashboard Cite

Abstract-A rate equation model of a gain clamped semiconductor optical amplifier (GCSOA) is presented. Both a timedomain and a small-signal analysis of those rate equations are used to investigate the crosstalk between different signal channels. It is shown that the crosstalk of GCSOA's strongly depends on the bit rate of the amplified signals and is lower at both very high bit rates and low bit rates. This crosstalk is proportional with the input power and, approximately, with the amplification.

show abstract

Performance improvement of automatic speech recognition systems via multiple language models produced by sentence-based clustering

Podder

Shaban

Sun

et al.

View full text Add to dashboard Cite

Grammar-based speech recognition systems exhibit performance degradation as their vocabulary sizes increase. Data clustering is deemed to reduce the proportionality of this problem. We introduce an approach to data clustering for automatic speech recognition systems using Kohonen Self-Organized Map. Clustering results are used further to build a language model for each of the clusters using CMUCambridge toolkit. The approach was implemented as a prototype for a large vocabulary and continuous speech recognition system and about 8% performance improvement was achieved in comparison with the performance achieved using the language model and dictionary provided by Sphinx3. In this paper we present the experimental results along with discussions, analysis and potential future directions.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jinying Sun

A novel optical decision circuit based on a Mach-Zehnder or Michelson interferometer and gain-clamped semiconductor optical amplifiers

A novel optical decision circuit based on a Mach-Zehnder interferometer and gain-clamped semiconductor optical amplifiers

A Novel Weighted Dynamic Time Warping for Light Weight Speaker-Dependent Speech Recognition in Noisy and Bad Recording Conditions

Numerical and theoretical study of the crosstalk in gain clamped semiconductor optical amplifiers

Performance improvement of automatic speech recognition systems via multiple language models produced by sentence-based clustering

Contact Info

Product

Resources

About