Mohammad Zeineldeen scite author profile

Mohammad Zeineldeen

5Publications

24Citation Statements Received

101Citation Statements Given

How they've been cited

How they cite others

Affiliations

RWTH Aachen University, FH Aachen

Publications

Order By: Most citations

Investigating Methods to Improve Language Model Integration for Attention-Based Encoder-Decoder ASR Models

Zeineldeen

Glushko²,

Michel

et al. 2021

View full text Add to dashboard Cite

We study a streamable attention-based encoder-decoder model in which either the decoder, or both the encoder and decoder, operate on pre-defined, fixed-size windows called chunks. A special end-of-chunk (EOC) symbol advances from one chunk to the next chunk, effectively replacing the conventional end-of-sequence symbol. This modification, while minor, situates our model as equivalent to a transducer model that operates on chunks instead of frames, where EOC corresponds to the blank symbol. We further explore the remaining differences between a standard transducer and our model. Additionally, we examine relevant aspects such as long-form speech generalization, beam size, and length normalization. Through experiments on Librispeech and TED-LIUM-v2, and by concatenating consecutive sequences for long-form trials, we find that our streamable model maintains competitive performance compared to the non-streamable variant and generalizes very well to long-form speech.

show abstract

Improving the Training Recipe for a Robust Conformer-based Hybrid Model

Zeineldeen¹,

Xu²,

Lüscher³

et al. 2022

View full text Add to dashboard Cite

Conformer-Based Hybrid ASR System For Switchboard Dataset

Zeineldeen¹,

Xu²,

Lüscher³

et al. 2022

View full text Add to dashboard Cite

Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition

Zhou

Zeineldeen

Zheng³

et al. 2021

View full text Add to dashboard Cite

Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures

Rossenbach¹,

Zeineldeen²,

Hilmes³

et al. 2021

Preprint

View full text Add to dashboard Cite

Recent publications on automatic-speech-recognition (ASR) have a strong focus on attention encoder-decoder (AED) architectures which work well for large datasets, but tend to overfit when applied in low resource scenarios. One solution to tackle this issue is to generate synthetic data with a trained text-tospeech system (TTS) if additional text is available. This was successfully applied in many publications with AED systems. We present a novel approach of silence correction in the data pre-processing for TTS systems which increases the robustness when training on corpora targeted for ASR applications. In this work we do not only show the successful application of synthetic data for AED systems, but also test the same method on a highly optimized state-of-the-art Hybrid ASR system and a competitive monophone based system using connectionisttemporal-classification (CTC). We show that for the later systems the addition of synthetic data only has a minor effect, but they still outperform the AED systems by a large margin on LibriSpeech-100h. We achieve a final word-error-rate of 3.3%/10.0% with a Hybrid system on the clean/noisy test-sets, surpassing any previous state-of-the-art systems that do not include unlabeled audio data.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.