Jinzi Qi scite author profile

Jinzi Qi

4Publications

4Citation Statements Received

16Citation Statements Given

How they've been cited

How they cite others

Affiliations

KU Leuven, Harbin Institute of Technology

Publications

Order By: Most citations

Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-Encoders

hamme

2021

View full text Add to dashboard Cite

Objective speech disorder classification for speakers with communication difficulty is desirable for diagnosis and administering therapy. With the current state of speech technology, it is evident to propose neural networks for this application. But neural network model training is hampered by a lack of labeled disordered speech data. In this research, we apply an extended version of Factorized Hierarchical Variational Autoencoders (FHVAE) for representation learning on disordered speech. The FHVAE model extracts both content-related and sequence-related latent variables from speech data, and we utilize the extracted variables to explore how disorder type information is represented in the latent variables. For better classification performance, the latent variables are aggregated at the word and sentence level. We show that an extension of the FHVAE model succeeds in the better disentanglement of the content-related and sequence-related related representations, but both representations are still required for best results on disorder type classification.

show abstract

Adaptive semisupervised feature selection without graph construction for very-high-resolution remote sensing images

Chen

et al. 2016

J. Appl. Remote Sens

View full text Add to dashboard Cite

Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-encoders

Qi¹,

hamme²

2021

Preprint

View full text Add to dashboard Cite

show abstract

Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation

Qi¹,

hamme²

2023

View full text Add to dashboard Cite

In dysarthric speech recognition, data scarcity and the vast diversity between dysarthric speakers pose significant challenges. While finetuning has been a popular solution, it can lead to overfitting and low parameter efficiency. Adapter modules offer a better solution, with their small size and easy applicability. Additionally, Adapter Fusion can facilitate knowledge transfer from multiple learned adapters, but may employ more parameters. In this work, we apply Adapter Fusion for target speaker adaptation and speech recognition, achieving acceptable accuracy with significantly fewer speaker-specific trainable parameters than classical finetuning methods. We further improve the parameter efficiency of the fusion layer by reducing the size of query and key layers and using Householder transformation to reparameterize the value linear layer. Our proposed fusion layer achieves comparable recognition results to the original method with only one third of the parameters.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jinzi Qi

Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-Encoders

Adaptive semisupervised feature selection without graph construction for very-high-resolution remote sensing images

Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-encoders

Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation

Contact Info

Product

Resources

About