ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683153
|View full text |Cite
|
Sign up to set email alerts
|

An Improved Air Tissue Boundary Segmentation Technique for Real Time Magnetic Resonance Imaging Video Using Segnet

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 19 publications
(14 citation statements)
references
References 22 publications
0
14
0
Order By: Relevance
“…The acoustic feature based method CDNN mfcc performs better than the articulatory feature based methods such as CDNNarti and CDNNiarti. Although we use the segnet model [31] for ATB prediction due to its best performance so far in the literature, the predicted upper and lower vocal tract boundaries are not as accurate as the ground truth ATBs. Thus in VTTP generation, the distance between the upper and lower vocal tract boundaries may not always be accurate enough to follow the vowel and consonant variation which could affect the final speech rate estimation.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The acoustic feature based method CDNN mfcc performs better than the articulatory feature based methods such as CDNNarti and CDNNiarti. Although we use the segnet model [31] for ATB prediction due to its best performance so far in the literature, the predicted upper and lower vocal tract boundaries are not as accurate as the ground truth ATBs. Thus in VTTP generation, the distance between the upper and lower vocal tract boundaries may not always be accurate enough to follow the vowel and consonant variation which could affect the final speech rate estimation.…”
Section: Methodsmentioning
confidence: 99%
“…The 30 distances are considered starting from the lips to the glottis. The upper and lower vocal tract boundary shapes (red and green curves in Figure 3) are obtained using the air-tissue boundary (ATB) segmentation technique proposed in [31]. The ATB segmentation approach uses a convolutional encoder-decoder network (Seg-Net).…”
Section: Mfcc Feature Extractionmentioning
confidence: 99%
“…To enable quantitative analysis of the information provided by these images, it is necessary to segment the anatomical features of interest, such as the vocal tract and articulators [1] , [2] , [3] , [4] , [5] . To avoid the time-consuming and expensive process of manual segmentation, several methods have been developed to perform this task semi or fully automatically [17] , [18] , [19] , [20] , [21] , [22] , [23] , [24] , [25] . One of these methods segmented the entire vocal tract [25] , while the others only labelled pixels at air-tissue boundaries and therefore created a partial contour for each articulator [17] , [18] , [19] , [20] , [21] , [22] , [23] , [24] .…”
Section: Introductionmentioning
confidence: 99%
“…SegNet also has many applications in the field of medical image segmentation . Moreover, its application in semantic segmentation is relatively mature . PSPNet is rarely used in medical image field .…”
Section: Introductionmentioning
confidence: 99%
“…[28][29][30][31] Moreover, its application in semantic segmentation is relatively mature. [32][33][34][35] PSPNet is rarely used in medical image field. 28 However, it is currently used in semantic segmentation and remote sensing image segmentation, [36][37][38] and can also be used for image classification.…”
Section: Introductionmentioning
confidence: 99%