2023
DOI: 10.1111/coin.12602
|View full text |Cite
|
Sign up to set email alerts
|

Using LSTM neural networks for cross‐lingual phonetic speech segmentation with an iterative correction procedure

Zdeněk Hanzlíček,
Jindřich Matoušek,
Jakub Vít

Abstract: This article describes experiments on speech segmentation using long short‐term memory recurrent neural networks. The main part of the paper deals with multi‐lingual and cross‐lingual segmentation, that is, it is performed on a language different from the one on which the model was trained. The experimental data involves large Czech, English, German, and Russian speech corpora designated for speech synthesis. For optimal multi‐lingual modeling, a compact phonetic alphabet was proposed by sharing and clustering… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 95 publications
0
0
0
Order By: Relevance