2017
DOI: 10.1016/j.procs.2017.05.092
|View full text |Cite
|
Sign up to set email alerts
|

Vocal signal analysis in patients affected by Multiple Sclerosis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(12 citation statements)
references
References 19 publications
0
12
0
Order By: Relevance
“…Some of those are used for assessment of voice quality [ 20 ]. Specifically, we took into account classical features such as duration of the file in seconds, fundamental frequency F0 and its standard deviation, Subharmonics and Harmonics information, Jitter [ 21 ], Shimmer [ 22 ], Formants information, Intensity, Speech Rate, Signal to Noise Ratio, Long-Term average spectrum (LTAS) [ 23 ], Root Mean Square energy (RMS), and some spectral shape information. In particular, the list of the 24 features extracted is: Duration in seconds : it is the length of the voice file in seconds; Mean F0 : it is the mean of the Fundamental frequency F0; STD F0 : it is the standard deviation of the Fundamental frequency F0; Subharmonics to Harmonic ratio (SHR) : amplitude ratio between subharmonics and harmonics according to [ 24 ]; Subharmonics pitch : the fundamental frequency estimate introduced in [ 24 ] for impaired speakers; Local jitter : parameter of frequency variation from cycle to cycle; Absolute jitter (accessed on 10 May 2021) : it is the average absolute difference between consecutive periods in seconds; RAP jitter (accessed on 10 May 2021) : Relative Average Perturbation, the average absolute difference between a period and the average of it and its two neighbours, divided by the average period; Local shimmer : amplitude variation of the sound wave [ 25 ]; F1 mean : it is the first formant (accessed on 10 May 2021); F2 mean : it is the second formant; F3 mean : it is the third formant; F4 mean : it is the fourth formant; Formant dispersion : it is the difference between F4 and F1 divided by 3; Mean intensity (accessed on 10 May 2021) : the mean (in dB) of the intensity values of the frames within a specified time domain; Speech rate : number of syllables divided by file duration in seconds; Signal to Noise ratio (SNR) ; LTAS (accessed on 10 May 2021) : it is the mean of logarithmic power spectral density as a function of frequency, computed over the entire domain frequency (from 0 Hz to 5000 Hz); LTAS slope ; LTAS standard deviation ; RMS energy : root-mean-square of energy; Spectrum centre of gravity (SCG) (accessed on 10 May 2021) : it is the average of frequency over the entire spectrum, weighted by the power spectrum; Spectrum standard deviation (accessed on 10 May 2021) : it is the variance of the frequencies in the spectrum; Band Energy difference (accessed on 10 May 2021) : it is the ratio between the average power over low (between 0 Hz and 500 Hz) and high (between 500 Hz and 4000 Hz) frequency bins in decibel scale; …”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Some of those are used for assessment of voice quality [ 20 ]. Specifically, we took into account classical features such as duration of the file in seconds, fundamental frequency F0 and its standard deviation, Subharmonics and Harmonics information, Jitter [ 21 ], Shimmer [ 22 ], Formants information, Intensity, Speech Rate, Signal to Noise Ratio, Long-Term average spectrum (LTAS) [ 23 ], Root Mean Square energy (RMS), and some spectral shape information. In particular, the list of the 24 features extracted is: Duration in seconds : it is the length of the voice file in seconds; Mean F0 : it is the mean of the Fundamental frequency F0; STD F0 : it is the standard deviation of the Fundamental frequency F0; Subharmonics to Harmonic ratio (SHR) : amplitude ratio between subharmonics and harmonics according to [ 24 ]; Subharmonics pitch : the fundamental frequency estimate introduced in [ 24 ] for impaired speakers; Local jitter : parameter of frequency variation from cycle to cycle; Absolute jitter (accessed on 10 May 2021) : it is the average absolute difference between consecutive periods in seconds; RAP jitter (accessed on 10 May 2021) : Relative Average Perturbation, the average absolute difference between a period and the average of it and its two neighbours, divided by the average period; Local shimmer : amplitude variation of the sound wave [ 25 ]; F1 mean : it is the first formant (accessed on 10 May 2021); F2 mean : it is the second formant; F3 mean : it is the third formant; F4 mean : it is the fourth formant; Formant dispersion : it is the difference between F4 and F1 divided by 3; Mean intensity (accessed on 10 May 2021) : the mean (in dB) of the intensity values of the frames within a specified time domain; Speech rate : number of syllables divided by file duration in seconds; Signal to Noise ratio (SNR) ; LTAS (accessed on 10 May 2021) : it is the mean of logarithmic power spectral density as a function of frequency, computed over the entire domain frequency (from 0 Hz to 5000 Hz); LTAS slope ; LTAS standard deviation ; RMS energy : root-mean-square of energy; Spectrum centre of gravity (SCG) (accessed on 10 May 2021) : it is the average of frequency over the entire spectrum, weighted by the power spectrum; Spectrum standard deviation (accessed on 10 May 2021) : it is the variance of the frequencies in the spectrum; Band Energy difference (accessed on 10 May 2021) : it is the ratio between the average power over low (between 0 Hz and 500 Hz) and high (between 500 Hz and 4000 Hz) frequency bins in decibel scale; …”
Section: Methodsmentioning
confidence: 99%
“…Some of those are used for assessment of voice quality [20]. Specifically, we took into account classical features such as duration of the file in seconds, fundamental frequency F0 and its standard deviation, Subharmonics and Harmonics information, Jitter [21], Shimmer [22], Formants information, Intensity, Speech Rate, Signal to Noise Ratio, Long-Term average spectrum (LTAS) [23], Root Mean Square energy (RMS), and some spectral shape information. In particular, the list of the 24 features extracted is:…”
Section: Second Experimentsmentioning
confidence: 99%
“…Speech abnormalities can be studied by analyzing several parameters obtained from vocal signal processing with the aim to describe the voice in a quantitative way and to identify patterns [37]. Two types of methods have been used to extract the parameters needed to furnish a more detailed analysis for the evaluation and identification of possible correlations between vocal signals and MS disease.…”
Section: Methodsmentioning
confidence: 99%
“…Studies showed a statistically significant correlation between dysphonic symptoms and MS, and the odds for having MS were 2.2 times higher if dysphonic symptoms were present with high jitter and shimmer values as well as high soft phonation index (an indicator of vocal fold adduction; high values correlate with incomplete adduction of the vocal fold [41]) values [42,43]. The objective acoustic analysis of speech seems to be more sensitive for discrimination between affected patients and healthy controls (90% accuracy) than experienced raters (35% accuracy) are, and thus could be used as a biomarker for diagnosis and the monitoring of disease progression [13,35,44,45].…”
Section: Brainstemmentioning
confidence: 99%