2021
DOI: 10.1109/access.2020.3048076
|View full text |Cite
|
Sign up to set email alerts
|

A Simple Speech Production System Based on Formant Estimation of a Tongue Articulatory System Using Human Tongue Orientation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 36 publications
0
4
0
Order By: Relevance
“…Frequency related features: Pitch (F0): Vocal cords vibration frequency (the fundamental frequency) that exists only in voiced speech (e.g., vowels). Voiced Vocalization (VV) was defined as a vocalization where most of its frames (≥ 60%) 10 were voiced (voicing threshold 0.45). Formants: The resonant frequencies of the vocal tract that shape vowel sounds 36 . The first two formants (F1 and F2) relate to tongue position (vertical and horizontal) and influence vowel quality.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Frequency related features: Pitch (F0): Vocal cords vibration frequency (the fundamental frequency) that exists only in voiced speech (e.g., vowels). Voiced Vocalization (VV) was defined as a vocalization where most of its frames (≥ 60%) 10 were voiced (voicing threshold 0.45). Formants: The resonant frequencies of the vocal tract that shape vowel sounds 36 . The first two formants (F1 and F2) relate to tongue position (vertical and horizontal) and influence vowel quality.…”
Section: Methodsmentioning
confidence: 99%
“… Formants: The resonant frequencies of the vocal tract that shape vowel sounds [40]. The first two formants (F1 and F2) relate to tongue position (vertical and horizontal) and influence vowel quality.…”
Section: Frequency Related Featuresmentioning
confidence: 99%
“…Off late, algorithms have been developed to facilitate straightforward communication through synthesized speech for those with total or partial loss of speech. They include electrolarynx [6], sign to speech converter [7], text to speech synthesis [8], silent sound technology [9], vocal cord vibration switches [10], articulatory speech synthesizers [11], brain implants [12], breath to speech [13], and, tongue articulatory systems [14]. They are based on inputs captured through hand gestures, text data, lip movements, vocalizations, visual features, brain signals, exhales, and tongue movements respectively.…”
Section: Introductionmentioning
confidence: 99%
“…They are based on inputs captured through hand gestures, text data, lip movements, vocalizations, visual features, brain signals, exhales, and tongue movements respectively. Such models can be useful to interpret speech for tracheostomized patients who have undergone larynx surgery, those who are speechdisabled due to accidents or voice disorders, medical rehabilitation, and robotics [6][7][8][9][10][11][12][13][14]. However, such techniques are used to synthesize the speech, whose voice is chosen either of google voice or robotic voice or a universal or generated voice database, where the speaker does not sound natural.…”
Section: Introductionmentioning
confidence: 99%