2022
DOI: 10.48550/arxiv.2201.09692
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improving Factored Hybrid HMM Acoustic Modeling without State Tying

Abstract: In this work, we show that a factored hybrid hidden Markov model (FH-HMM) which is defined without any phonetic state-tying outperforms a state-of-the-art hybrid HMM. The factored hybrid HMM provides a link to transducer models in the way it models phonetic (label) context while preserving the strict separation of acoustic and language model of the hybrid HMM approach. Furthermore, we show that the factored hybrid model can be trained from scratch without using phonetic state-tying in any of the training steps… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

1
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 30 publications
1
1
0
Order By: Relevance
“…We develop a multi-head decoding method, which yields the best results. We find that framelevel training is still useful, but on the other hand, expert pronunciation-lexicons and tree-clustering for state-tying do not appear necessary, echoing other recent work [14], [15], and potentially simplifying the HMM / DNN-system.…”
supporting
confidence: 67%
See 1 more Smart Citation
“…We develop a multi-head decoding method, which yields the best results. We find that framelevel training is still useful, but on the other hand, expert pronunciation-lexicons and tree-clustering for state-tying do not appear necessary, echoing other recent work [14], [15], and potentially simplifying the HMM / DNN-system.…”
supporting
confidence: 67%
“…Additionally, we find that the simple pruning Flat Start LF-MMI outperforms the treeclustering state-tied LF-MMI in our test on Finnish Parliament Train20. This is surprising, but seems to suggest that the treeclustering does not always yield better performance in HMM acoustic modeling, which is also suggested in other recent work [15] and would simplify the HMM / DNN-system further. The investigation of this phenomenon is out of scope for this work.…”
Section: A Hybrid Hidden Markov Model / Deep Neural Network Systemsmentioning
confidence: 51%