Taniya Hasija scite author profile

Taniya Hasija

5Publications

14Citation Statements Received

46Citation Statements Given

How they've been cited

How they cite others

150

Affiliations

Chitkara University

Publications

Order By: Most citations

Out Domain Data Augmentation on Punjabi Children Speech Recognition using Tacotron

Hasija

Kadyan

Guleria

2021

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

The performance of Automatic Speech Recognition (ASR) is directly proportional to the quality of the corpus used and the training data quantity. Data scarcity and more children’s speech variability degrades the performance of ASR systems. As Punjabi is a tonal language and low resource language, less data is available for Punjabi children’s speech. It leads to poor ASR performance for Punjabi children speech recognition. To overcome limited data conditions, in this paper, two corpora of different domains are evaluated for testing the feasibility of ASR performance. We have implemented Tacotron as an artificial speech synthesis system for Punjabi Language. The speech audios synthesized by Tacotron are merged with available speech corpus and tested on Punjabi children ASR using Mel Frequency Cepstral Coefficients (MFCC) + pitch feature extraction, and Deep Neural Network (DNN) acoustic modeling. It is noticed that the merged data corpus has shown reduced Word Error Rate (WER) of the ASR system with a Relative Improvement (RI) of 9-12%.

show abstract

Recognition of Children Punjabi Speech using Tonal Non-Tonal Classifier

Hasija

Kadyan

Guleria

2021

View full text Add to dashboard Cite

Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System

et al. 2022

View full text Add to dashboard Cite

Speech recognition has been an active field of research in the last few decades since it facilitates better human–computer interaction. Native language automatic speech recognition (ASR) systems are still underdeveloped. Punjabi ASR systems are in their infancy stage because most research has been conducted only on adult speech systems; however, less work has been performed on Punjabi children’s ASR systems. This research aimed to build a prosodic feature-based automatic children speech recognition system using discriminative modeling techniques. The corpus of Punjabi children’s speech has various runtime challenges, such as acoustic variations with varying speakers’ ages. Efforts were made to implement out-domain data augmentation to overcome such issues using Tacotron-based text to a speech synthesizer. The prosodic features were extracted from Punjabi children’s speech corpus, then particular prosodic features were coupled with Mel Frequency Cepstral Coefficient (MFCC) features before being submitted to an ASR framework. The system modeling process investigated various approaches, which included Maximum Mutual Information (MMI), Boosted Maximum Mutual Information (bMMI), and feature-based Maximum Mutual Information (fMMI). The out-domain data augmentation was performed to enhance the corpus. After that, prosodic features were also extracted from the extended corpus, and experiments were conducted on both individual and integrated prosodic-based acoustic features. It was observed that the fMMI technique exhibited 20% to 25% relative improvement in word error rate compared with MMI and bMMI techniques. Further, it was enhanced using an augmented dataset and hybrid front-end features (MFCC + POV + Fo + Voice quality) with a relative improvement of 13% compared with the earlier baseline system.

show abstract

A Survey on Performance Analysis of Different Architectures of AES Algorithm on FPGA

Hasija

Kaur

Ramkumar

et al. 2023

View full text Add to dashboard Cite

In domain training data augmentation on noise robust Punjabi Children speech recognition

Kadyan

Bawa

Hasija

2021

J Ambient Intell Human Comput

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Taniya Hasija

Out Domain Data Augmentation on Punjabi Children Speech Recognition using Tacotron

Recognition of Children Punjabi Speech using Tonal Non-Tonal Classifier

Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System

A Survey on Performance Analysis of Different Architectures of AES Algorithm on FPGA

In domain training data augmentation on noise robust Punjabi Children speech recognition

Contact Info

Product

Resources

About