2018 IEEE International Conference on Big Data (Big Data) 2018
DOI: 10.1109/bigdata.2018.8622012
|View full text |Cite
|
Sign up to set email alerts
|

Parameterization of Sequence of MFCCs for DNN-based voice disorder detection

Abstract: In this article a DNN-based system for detection of three common voice disorders (vocal nodules, polyps and cysts; laryngeal neoplasm; unilateral vocal paralysis) is presented. The input to the algorithm is (at least 3-second long) audio recording of sustained vowel sound /a:/. The algorithm was developed as part of the "2018 FEMH Voice Data Challenge" organized by Far Eastern Memorial Hospital and obtained score value (defined in the challenge specification) of 77.44. This was the second best result before fi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 10 publications
0
9
0
Order By: Relevance
“…Currently, any research team can use it for their study purposes. Moreover, our method is ready to be released to the market and could be used, e.g., with StethoMe [41] device for remote diagnosis and educational purposes.…”
Section: Discussionmentioning
confidence: 99%
“…Currently, any research team can use it for their study purposes. Moreover, our method is ready to be released to the market and could be used, e.g., with StethoMe [41] device for remote diagnosis and educational purposes.…”
Section: Discussionmentioning
confidence: 99%
“…where k is the number of frames and n is the number of samples by which the window is shifted in order to yield the ith frame then taking the DFT of the resulting signal [21].…”
Section: ) Framing and Windowingmentioning
confidence: 99%
“…These energies are also known as the Mel spectrum and can be used for calculating the first 13 coefficients using DCT. A popular formula to convert f in hertz into ݂ is given in (2) [20,21]:…”
Section: ) Mel-spectrummentioning
confidence: 99%
“…The recent works applied unsupervised domain adaptation to address the hardware variation [19] . Various acoustic features, including cepstral features [20] , [21] , vocal jitter [22] , and entropy [23] were also investigated in the literature. The IEEE Big Data conference held an international competition in Seattle 2018, called FEMH-Challenge, in which voice pathology detection systems from different research groups worldwide are evaluated empirically on the same dataset, which was published by Far Eastern Memorial Hospital (FEMH), Taiwan [24] .…”
Section: Introductionmentioning
confidence: 99%