2021
DOI: 10.1016/j.cmpb.2020.105814
|View full text |Cite
|
Sign up to set email alerts
|

Deep-learning-based segmentation of the vocal tract and articulators in real-time magnetic resonance images of speech

Abstract: Highlights A method to segment the vocal tract and articulators in MR images was developed. Median accuracy: Dice coefficient of 0.92; general Hausdorff distance of 5mm. Developed to facilitate quantitative analysis of the vocal tract and articulators. Intended for use in clinical and non-clinical studies of speech. A novel clinically relevant segmentation accuracy metric was also developed.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(20 citation statements)
references
References 28 publications
0
20
0
Order By: Relevance
“…The proposed framework includes a deep-learning-based method to estimate segmentations of the following six anatomical features in the image pair: the head, soft palate, jaw, tongue, vocal tract and tooth space. This method is described in [56] and consists of two steps. First, segmentations of the six anatomical features in the image pair are estimated using a pre-trained CNN.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…The proposed framework includes a deep-learning-based method to estimate segmentations of the following six anatomical features in the image pair: the head, soft palate, jaw, tongue, vocal tract and tooth space. This method is described in [56] and consists of two steps. First, segmentations of the six anatomical features in the image pair are estimated using a pre-trained CNN.…”
Section: Methodsmentioning
confidence: 99%
“…Second, a connected-component-based post-processing step is performed to remove anatomically impossible regions from the segmentations. For full information about the segmentation method, the reader is referred to [56] .…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Furthermore, those approaches which are available have not yet been demonstrated to generalise beyond the individual datasets for which they were developed. These methodologies typically involve automated or machine learning processes which are trained and tested against a narrow range of data, typically composed of a small number of speakers scanned at a single imaging centre [12][13][14][15][16][17][18]. The development of these techniques has sampled disproportionally from a single image repository [6].…”
Section: Introductionmentioning
confidence: 99%