2022
DOI: 10.3390/cancers14102366
|View full text |Cite
|
Sign up to set email alerts
|

Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery

Abstract: Laryngeal carcinoma is the most common malignant tumor of the upper respiratory tract. Total laryngectomy provides complete and permanent detachment of the upper and lower airways that causes the loss of voice, leading to a patient’s inability to verbally communicate in the postoperative period. This paper aims to exploit modern areas of deep learning research to objectively classify, extract and measure the substitution voicing after laryngeal oncosurgery from the audio signal. We propose using well-known con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 77 publications
0
8
0
Order By: Relevance
“…AVQI was used to assess general vocal quality, and ABI was used to assess breathiness, both of which were found to be relevant. Other researchers wanted to use cutting-edge deep learning research to objectively categorize, extract, and assess substitute voicing following laryngeal oncosurgery from audio signals [35]. Their technique had the highest true-positive rate of all of the cutting-edge approaches examined, reaching an acceptable overall accuracy and demonstrating the practical usage of voice quality devices.…”
Section: State Of the Art Reviewmentioning
confidence: 99%
“…AVQI was used to assess general vocal quality, and ABI was used to assess breathiness, both of which were found to be relevant. Other researchers wanted to use cutting-edge deep learning research to objectively categorize, extract, and assess substitute voicing following laryngeal oncosurgery from audio signals [35]. Their technique had the highest true-positive rate of all of the cutting-edge approaches examined, reaching an acceptable overall accuracy and demonstrating the practical usage of voice quality devices.…”
Section: State Of the Art Reviewmentioning
confidence: 99%
“…Objective evaluation is more challenging in patients who have undergone extended cordectomy, or partial or total laryngectomy as they utilize substitution voicing, where voice is generated by a single oscillating vocal fold against remaining laryngeal and pharyngeal structures, alaryngeal (esophageal or tracheoesophageal) processes, vocal prostheses, or electrolarynx. Maskeliunas et al [56 ▪▪ ,57] used DL to objectively categorize, extract, and assess substitution voicing from speech audio signals. Mel-frequency spectrograms (visual representation of the frequency spectrum of an acoustic signal) were used as input to a deep neural network architecture with an overall accuracy of 89.5%.…”
Section: Machine Learning Applications To Assess Voice and Swallowing...mentioning
confidence: 99%
“…Furthermore, morphological characteristics supplied by CT are insufficient to evaluate the biological characteristics of the primary tumor. At present, radiomics overcomes the insufficiency of the above traditional imaging techniques, widely used in clinical diagnosis, treatment, and prognosis ( 15 , 16 ). The qualitative and quantitative assessment of lesion characteristics and intratumoral spatial heterogeneity can contribute to improving the non-invasive preoperative diagnosis accuracy of HNSCC.…”
Section: Introductionmentioning
confidence: 99%