Hossein Hadian scite author profile

We present our work on end-to-end training of acoustic models using the lattice-free maximum mutual information (LF-MMI) objective function in the context of hidden Markov models. By end-to-end training, we mean flat-start training of a single DNN in one stage without using any previously trained models, forced alignments, or building state-tying decision trees. We use full biphones to enable context-dependent modeling without trees, and show that our end-to-end LF-MMI approach can achieve comparable results to regular LF-MMI on well-known large vocabulary tasks. We also compare with other end-to-end methods such as CTC in character-based and lexicon-free settings and show 5 to 25 percent relative reduction in word error rates on different large vocabulary tasks while using significantly smaller models.

show abstract

Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI

Manohar

Hadian

Povey

et al. 2018

View full text Add to dashboard Cite

Investigation of transfer learning for ASR using LF-MMI trained neural networks

Ghahremani

Manohar

Hadian

et al. 2017

View full text Add to dashboard Cite

Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR

Hadian

Sameti

Povey

et al. 2018

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hossein Hadian

A Time-Restricted Self-Attention Layer for ASR

End-to-end Speech Recognition Using Lattice-free MMI

Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI

Investigation of transfer learning for ASR using LF-MMI trained neural networks

Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR

Contact Info

Product

Resources

About