Prosodic tools for language learning

Delmonte, Rodolfo

doi:10.1007/s10772-010-9065-1

Cited by 13 publications

(8 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…ASR incorporating neural networks, probabilistic classifiers and decision making schemes, such as GMM, HMM and SVM, is one of the most effective technologies for CAPT design. Yet, some discrete precautions should be taken to apply ASR in a controlled CAPT environment [38]. A case in point is the unsuitability of ASR for automatic evaluation of learner input.…”

Section: Literature Reviewmentioning

confidence: 99%

Speech Processing for Language Learning: A Practical Approach to Computer-Assisted Pronunciation Teaching

et al. 2021

View full text Add to dashboard Cite

This article contributes to the discourse on how contemporary computer and information technology may help in improving foreign language learning not only by supporting better and more flexible workflow and digitizing study materials but also through creating completely new use cases made possible by technological improvements in signal processing algorithms. We discuss an approach and propose a holistic solution to teaching the phonological phenomena which are crucial for correct pronunciation, such as the phonemes; the energy and duration of syllables and pauses, which construct the phrasal rhythm; and the tone movement within an utterance, i.e., the phrasal intonation. The working prototype of StudyIntonation Computer-Assisted Pronunciation Training (CAPT) system is a tool for mobile devices, which offers a set of tasks based on a “listen and repeat” approach and gives the audio-visual feedback in real time. The present work summarizes the efforts taken to enrich the current version of this CAPT tool with two new functions: the phonetic transcription and rhythmic patterns of model and learner speech. Both are designed on a base of a third-party automatic speech recognition (ASR) library Kaldi, which was incorporated inside StudyIntonation signal processing software core. We also examine the scope of automatic speech recognition applicability within the CAPT system workflow and evaluate the Levenstein distance between the transcription made by human experts and that obtained automatically in our code. We developed an algorithm of rhythm reconstruction using acoustic and language ASR models. It is also shown that even having sufficiently correct production of phonemes, the learners do not produce a correct phrasal rhythm and intonation, and therefore, the joint training of sounds, rhythm and intonation within a single learning environment is beneficial. To mitigate the recording imperfections voice activity detection (VAD) is applied to all the speech records processed. The try-outs showed that StudyIntonation can create transcriptions and process rhythmic patterns, but some specific problems with connected speech transcription were detected. The learners feedback in the sense of pronunciation assessment was also updated and a conventional mechanism based on dynamic time warping (DTW) was combined with cross-recurrence quantification analysis (CRQA) approach, which resulted in a better discriminating ability. The CRQA metrics combined with those of DTW were shown to add to the accuracy of learner performance estimation. The major implications for computer-assisted English pronunciation teaching are discussed.

show abstract

Section: Literature Reviewmentioning

confidence: 99%

Speech Processing for Language Learning: A Practical Approach to Computer-Assisted Pronunciation Teaching

et al. 2021

View full text Add to dashboard Cite

show abstract

“…Computer-Assisted Prosody Teaching (CAPT) tools integrated with various speech processing technologies make it possible to obtain pitch plots so that to provide a visual representation of the speech. From a number of research works, we learn that, within L2 learning activities, training enhanced by such a visualization of pitch contours has a positive effect on learner's pronunciation (e. g., [9,10]). In particular, the authors of [11] conducted an interesting study, where they used speech analysis software (Praat) to present a visual display of the Chinese native speaker's pitch curves for learners, then asked learners to record themselves repeating the same words and compare their pitch contours with those of the native speaker.…”

Section: Capt Tools For Language Pronunciation Trainingmentioning

confidence: 99%

Adopting StudyIntonation CAPT Tools to Tonal Languages Through the Example of Vietnamese

et al. 2021

View full text Add to dashboard Cite

In tonal languages, tones are associated with both and phonological and lexical domains. Accurate tone articulation is required in order to convey the correct meaning. Learning tones at both word and phrase levels is often challenging for L2 learners with non-tonal language background, because of possible subtle difference between the close tones. In this paper, we discuss an adoption of StudyIntonation CAPT tools to the case of Vietnamese language being a good example of register tonal language with a complex system of tones comprising such features as tone pitch, its length, contour melody, intensity and phonation. The particular focus of this contribution is to assess the adoption of StudyIntonation course toolkit and its pitch processing and visualization algorithms in order to evaluate how the combined use of audio and visual perception mechanisms supported by StudyIntonation may help learners to improve the accuracy of their pronunciation and intonation with respect to tonal languages.

show abstract

“…i.e. there is no need to refer to G. What (30) shows is that as N grows, the probability of getting previously unseen samples decreases linearly. Furthermore, the probability of the new sample to be equal to θ k is equal to (α 0 + N) −1 N k .…”

Section: Infinite Mixture Models and The Dirichlet Processesmentioning

confidence: 99%

Speech and Language Technologies

Ipšić¹

2011

View full text Add to dashboard Cite

show abstract

Prosodic tools for language learning

Cited by 13 publications

References 31 publications

Speech Processing for Language Learning: A Practical Approach to Computer-Assisted Pronunciation Teaching

Speech Processing for Language Learning: A Practical Approach to Computer-Assisted Pronunciation Teaching

Adopting StudyIntonation CAPT Tools to Tonal Languages Through the Example of Vietnamese

Speech and Language Technologies

Contact Info

Product

Resources

About