2018
DOI: 10.1121/1.5039750
|View full text |Cite
|
Sign up to set email alerts
|

Reconstruction of articulatory movements during neutral speech from those during whispered speech

Abstract: A transformation function (TF) that reconstructs neutral speech articulatory trajectories (NATs) from whispered speech articulatory trajectories (WATs) is investigated, such that the dynamic time warped (DTW) distance between the transformed whispered and the original neutral articulatory movements is minimized. Three candidate TFs are considered: an affine function with a diagonal matrix ( A) which reconstructs one NAT from the corresponding WAT, an affine function with a full matrix ( A) and a deep neural ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 31 publications
0
2
0
Order By: Relevance
“…Data obtained through these sensor positions allow to estimate variations in lip aperture or lip protrusion that are phonetically relevant (e.g., production of bilabial stops as compared to fricatives, or between rounded and unrounded vowels). In some cases, such as when a study focuses on lip movements specifically, more lip sensors are attached, namely at the right and/or left lip corners (e.g., Meenakshi & Ghosh, 2018;Rong et al, 2012;Cler, Lee, Mittelman, Stepp, & Bohland, 2017).…”
Section: Use and Positioningmentioning
confidence: 99%
“…Data obtained through these sensor positions allow to estimate variations in lip aperture or lip protrusion that are phonetically relevant (e.g., production of bilabial stops as compared to fricatives, or between rounded and unrounded vowels). In some cases, such as when a study focuses on lip movements specifically, more lip sensors are attached, namely at the right and/or left lip corners (e.g., Meenakshi & Ghosh, 2018;Rong et al, 2012;Cler, Lee, Mittelman, Stepp, & Bohland, 2017).…”
Section: Use and Positioningmentioning
confidence: 99%
“…Motivated by the methodology proposed in [29], we consider a set of M training utterances for N, F and S speaking rates, each comprising articulatory movement data from K articulators. Let R stand for either F or S rate movements.…”
Section: Learning the Transformation Functionsmentioning
confidence: 99%