2022
DOI: 10.1016/j.jvoice.2022.07.007
|View full text |Cite
|
Sign up to set email alerts
|

Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(13 citation statements)
references
References 21 publications
0
13
0
Order By: Relevance
“…The features of the MFCCs of healthy and laryngeal cancer speech are quite distinct in table 2, so there is no performance improvement in applying the CNN after image conversion. These results demonstrate a significant improvement in accuracy by more than 0.25 compared to prior studies 12,13 . This improvement is attributed to the effect of segmenting the audio signals into fine-grained MFCC data, which enhances data augmentation while preserving key features.…”
Section: Healthy Vs Laryngeal Cancermentioning
confidence: 45%
See 1 more Smart Citation
“…The features of the MFCCs of healthy and laryngeal cancer speech are quite distinct in table 2, so there is no performance improvement in applying the CNN after image conversion. These results demonstrate a significant improvement in accuracy by more than 0.25 compared to prior studies 12,13 . This improvement is attributed to the effect of segmenting the audio signals into fine-grained MFCC data, which enhances data augmentation while preserving key features.…”
Section: Healthy Vs Laryngeal Cancermentioning
confidence: 45%
“…Kim et al 12 distinguished laryngeal cancer and healthy controls with 0.85 accuracy in one-dimensional convolutional neural network (1D-CNN) using voice data. Kwon et al 13 acquired 0.95 of accuracy in the fusion of the laryngeal image and voice data by ensemble learning of two CNN models for the same task, but the accuracy was limited to 0.71, when classifying with only voice data. These voice-based AI laryngeal cancer diagnosis models have the limitation of simply classifying laryngeal cancer and healthy condition.…”
Section: Introductionmentioning
confidence: 99%
“…Several ML methods using vocal recordings perform binary classification to distinguish voices from patients with laryngeal cancer from those with healthy voices or benign voice disorders with accuracy ranging from 85.2% to 98% [81][82][83][84][85][86][87][88]89 ], which is derived from a transformation of the audio signal and provides a compact representation of the spectral properties of a sound. Others algorithms rely on acoustic features (jitter, shimmer, and harmonic features) [81,88] and glottal air-flow parameters [84,86].…”
Section: Machine Learning Models Utilizing Voice and Speech To Screen...mentioning
confidence: 99%
“…Several ML methods using vocal recordings perform binary classification to distinguish voices from patients with laryngeal cancer from those with healthy voices or benign voice disorders with accuracy ranging from 85.2% to 98% [81–88,89 ▪ ]. The most common feature extracted is the Mel-frequency cepstral coefficient (MFCC) [81–83,85,89 ▪ ], which is derived from a transformation of the audio signal and provides a compact representation of the spectral properties of a sound.…”
Section: Machine Learning Models Utilizing Voice and Speech To Screen...mentioning
confidence: 99%
See 1 more Smart Citation