Huang Xie scite author profile

Huang Xie

5Publications

50Citation Statements Received

71Citation Statements Given

How they've been cited

How they cite others

119

Affiliations

Tampere University, Jiangsu University of Science and Technology

Publications

Order By: Most citations

Zero-Shot Audio Classification Based On Class Label Embeddings

Xie

Virtanen

2019

View full text Add to dashboard Cite

This paper proposes a zero-shot learning approach for audio classification based on the textual information about class labels without any audio samples from target classes. We propose an audio classification system built on the bilinear model, which takes audio feature embeddings and semantic class label embeddings as input, and measures the compatibility between an audio feature embedding and a class label embedding. We use VGGish to extract audio feature embeddings from audio recordings. We treat textual labels as semantic side information of audio classes, and use Word2Vec to generate class label embeddings. Results on the ESC-50 dataset show that the proposed system can perform zeroshot audio classification with small training dataset. It can achieve accuracy (26 % on average) better than random guess (10 %) on each audio category. Particularly, it reaches up to 39.7 % for the category of natural audio classes.

show abstract

Zero-Shot Audio Classification Via Semantic Embeddings

Xie

Virtanen

2021

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Effect of Tribofilm Induced by Nanoparticle Addition on Wear Behavior of Titanium-Matrix Composite

et al. 2021

View full text Add to dashboard Cite

Zero-Shot Audio Classification via Semantic Embeddings

Xie

Virtanen

2020

Preprint

View full text Add to dashboard Cite

Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases

Xie

Räsänen

Drossos

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Huang Xie

Zero-Shot Audio Classification Based On Class Label Embeddings

Zero-Shot Audio Classification Via Semantic Embeddings

Effect of Tribofilm Induced by Nanoparticle Addition on Wear Behavior of Titanium-Matrix Composite

Zero-Shot Audio Classification via Semantic Embeddings

Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases

Contact Info

Product

Resources

About