Speech Prosody 2018 2018
DOI: 10.21437/speechprosody.2018-159
|View full text |Cite
|
Sign up to set email alerts
|

Speech, Prosody, and Machines: Nine Challenges for Prosody Research

Abstract: Speech technology is becoming commonplace. Traditional telephony based interactive voice systems have been joined by virtual assistants and navigation systems to create a broad ecosystem of voice enabled technologies. Prosody is an essential component to human communication, but machines still lag in their ability to understand information communicated prosodically and to produce human-like intonation. This paper poses nine challenges designed to effectively and more thoroughly integrate prosody into current s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 52 publications
0
6
0
Order By: Relevance
“…Nevertheless, building a dataset with large available labeled samples is costly, time-consuming, and laborious work in the automatic personality perception field, which restricts various methods. Therefore, previous studies have used handcrafted features for the DNN input [44].…”
Section: Feature Extractionmentioning
confidence: 99%
“…Nevertheless, building a dataset with large available labeled samples is costly, time-consuming, and laborious work in the automatic personality perception field, which restricts various methods. Therefore, previous studies have used handcrafted features for the DNN input [44].…”
Section: Feature Extractionmentioning
confidence: 99%
“…One remaining challenge with speech-based personality perception is the limitation in the dataset, which has been discussed frequently in academic conferences for a decade [42]. Some reasons behind these limitations are: 1) most of them are not public, 2) some of them are not prosodically annotated, 3) labeling is an expensive process, 4) training annotators is a difficult trend, and 5) at least one psychologist must be employed to supervise the annotating process.…”
Section: Datasetmentioning
confidence: 99%
“…Because the prosodic content of speech must be preserved during transformations. So, those transformations must be examined to ensure if the speaker personality differences maintained [42].…”
Section: B Data Augmentationmentioning
confidence: 99%
“…Segmentation can also put less demands on working memory which later benefits the reanalysis process. Syntactic parsing is closely related to prosodic phrasing in the way that readers' ability to phrase words while applying some prosodic contour or intonation to the sentence, even in silent reading (Breen, 2014;Fodor, 2002;Frazier & Gibson, 2015), develops along with the automaticity of syntactic parsing. Reading aloud for beginner and intermediate learners to prime the correct segmentation from prosody can help learners see the pattern easier and develop their own parsing ability.…”
Section: Putting Things Together: Implications For L2 Reading Pedagogymentioning
confidence: 99%