D S Pavan Kumar scite author profile

D S Pavan Kumar

3Publications

4Citation Statements Received

61Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Implementing Fusion Techniques for the Classification of Paralinguistic Information

Vlasenko¹,

Sebastian²,

Kumar³

et al. 2018

View full text Add to dashboard Cite

This work tests several classification techniques and acoustic features and further combines them using late fusion to classify paralinguistic information for the ComParE 2018 challenge. We use Multiple Linear Regression (MLR) with Ordinary Least Squares (OLS) analysis to select the most informative features for Self-Assessed Affect (SSA) sub-Challenge. We also propose to use raw-waveform convolutional neural networks (CNN) in the context of three paralinguistic sub-challenges. By using combined evaluation split for estimating codebook, we obtain better representation for Bag-of-Audio-Words approach. We preprocess the speech to vocalized segments to improve classification performance. For fusion of our leading classification techniques, we use weighted late fusion approach applied for confidence scores. We use two mismatched evaluation phases by exchanging the training and development sets, and this estimates the optimal fusion weight. Weighted late fusion provides better performance on development sets in comparison with baseline techniques. Raw-waveform techniques perform comparable to the baseline.

show abstract

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech

Sebastian

Kumar

Kumar³

et al. 2018

View full text Add to dashboard Cite

This paper presents a raw-waveform neural network and uses it along with a denoising network for clustering in weaklysupervised learning scenarios under extreme noise conditions. Specifically, we consider language independent Automatic Gender Recognition (AGR) on a set of varied noise conditions and Signal to Noise Ratios (SNRs). We formulate the denoising problem as a source separation task and train the system using a discriminative criterion in order to enhance output SNRs. A denoising Recurrent Neural Network (RNN) is first trained on a small subset (roughly one-fifth) of the data for learning a speechspecific mask. The denoised speech signal is then directly fed as input to a raw-waveform convolutional neural network (CNN) trained with denoised speech. We evaluate the standalone performance of denoiser in terms of various signal-to-noise measures and discuss its contribution towards robust AGR. An absolute improvement of 11.06% and 13.33% is achieved by the combined pipeline over the i-vector SVM baseline system for 0 dB and-5 dB SNR conditions, respectively. We further analyse the information captured by the first CNN layer in both noisy and denoised speech.

show abstract

Implementation of Speech Recognition in Web Application for Sub Continental Language

Kumar¹,

Sachan²,

Kumar³

2014

IJETT

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

D S Pavan Kumar

Implementing Fusion Techniques for the Classification of Paralinguistic Information

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech

Implementation of Speech Recognition in Web Application for Sub Continental Language

Contact Info

Product

Resources

About