2009
DOI: 10.1016/j.specom.2008.11.004
|View full text |Cite
|
Sign up to set email alerts
|

Foreign accent conversion in computer assisted pronunciation training

Abstract: Learners of a second language practice their pronunciation by listening to and imitating utterances from native speakers. Recent research has shown that choosing a well-matched native speaker to imitate can have a positive impact on pronunciation training. Here we propose a voicetransformation technique that can be used to generate the (arguably) ideal voice to imitate: the own voice of the learner with a native accent. Our work extends previous research, which suggests that providing learners with prosodicall… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
104
1

Year Published

2010
2010
2023
2023

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 117 publications
(108 citation statements)
references
References 47 publications
3
104
1
Order By: Relevance
“…To develop this technique, we need a deep understanding of how to effectively factorize speech acoustics into its individual components such as linguistic, non-linguistic, and para-linguistic information using various technologies, such as speech analysis, speech synthesis, acoustic modeling, and machine learning. Moreover, VC has great potential to develop various applications not only for flexible control of speaker identity of synthetic speech in textto-speech (TTS) [1] but also as a speaking aid for vocally handicapped people such as dysarthric patients [2] and laryngectomees [3], as a voice changer to flexibly generate various types of emotional [4] and expressive speech [5], for vocal effects to produce more varieties of singing voices [6,7], for enhanced mobile speech communication using wideband speech [8] and silent speech [9], accent conversion for computer assisted language learning [10], and so on. Therefore, it is worthwhile to study this technique for both scientific purposes and industrial applications.…”
Section: Introductionmentioning
confidence: 99%
“…To develop this technique, we need a deep understanding of how to effectively factorize speech acoustics into its individual components such as linguistic, non-linguistic, and para-linguistic information using various technologies, such as speech analysis, speech synthesis, acoustic modeling, and machine learning. Moreover, VC has great potential to develop various applications not only for flexible control of speaker identity of synthetic speech in textto-speech (TTS) [1] but also as a speaking aid for vocally handicapped people such as dysarthric patients [2] and laryngectomees [3], as a voice changer to flexibly generate various types of emotional [4] and expressive speech [5], for vocal effects to produce more varieties of singing voices [6,7], for enhanced mobile speech communication using wideband speech [8] and silent speech [9], accent conversion for computer assisted language learning [10], and so on. Therefore, it is worthwhile to study this technique for both scientific purposes and industrial applications.…”
Section: Introductionmentioning
confidence: 99%
“…Finally, Felps, Bortfeld & Osuna (2009) stated that foreign accent can be a deviation from the expected acoustic and prosodic norms of language.…”
Section: Background Of the Studymentioning
confidence: 99%
“…Accent rating-the degree of foreign accent or type of accent of a talker-is a perceptual evaluation task that is relevant to a variety of different tasks within speech technology, e.g., in computer assisted language learning [1,2], for accent conversion [3,4], for accent identification [5,6], to reduce the impact of non-native accents on word error rates in ASR [7,8], and in the context of adverse listening conditions [9]. The study presented here was conducted in the context of an EU project which aimed for personalized speech-to-speech translation such that a user's spoken input in one language was used to produce spoken output in another language, while continuing to sound like the user's voice [10].…”
Section: Introductionmentioning
confidence: 99%