2023
DOI: 10.1126/sciadv.adh0478
|View full text |Cite
|
Sign up to set email alerts
|

Decoding and synthesizing tonal language speech from brain activity

Abstract: Recent studies have shown that the feasibility of speech brain-computer interfaces (BCIs) as a clinically valid treatment in helping nontonal language patients with communication disorders restore their speech ability. However, tonal language speech BCI is challenging because additional precise control of laryngeal movements to produce lexical tones is required. Thus, the model should emphasize the features from the tonal-related cortex. Here, we designed a modularized multistream neural network that directly … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(14 citation statements)
references
References 35 publications
1
13
0
Order By: Relevance
“…Tone, a crucial aspect of Mandarin syllables, differentiates both lexical and grammatical meanings through variations in pitch carried by the finals 21,28 . We visualized the pitch changes over time for the four Mandarin tones (Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Tone, a crucial aspect of Mandarin syllables, differentiates both lexical and grammatical meanings through variations in pitch carried by the finals 21,28 . We visualized the pitch changes over time for the four Mandarin tones (Fig.…”
Section: Resultsmentioning
confidence: 99%
“…For example, all the patient articulated the tone of "不" in "不计得失" with a pitch trajectory more similar to tone 2 rather than tone 4 in single syllable form or in other phrases such as "不得不", due to the rule of tone sandhi (Fig 4a Similarly, given only monosyllabic information, the tone decoder would perform suboptimal. To show this, we adopted a baseline monosyllable decoder model previously used in Liu et al 15 . The monosyllable decoder only took in the neural activity aligned to the current syllable utterance and did not consider contextual syllables.…”
Section: Robustness Of the Speech Decoder Under Tonal Variance In Nat...mentioning
confidence: 99%
“…The distinct feature of these languages is the use of pitch to distinguish lexical and grammatical meaning. While prior research has investigated decoding stereotypical instances of lexical tones from neural activity for monosyllabic speech 15 , decoding continuous tonal sentences is still a challenging issue. Unlike the relatively stable acoustic cues in canonical forms, natural speech introduces substantial variability in tone components.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, any adjustments to acoustic predictions must be applied in real time, for example, through parallel phonetic or prosodic predictions. In the latter case, it would be conceivable to apply real-time pitch shifting to create artificial tones [64] or intonation patterns [46] that carry additional meaning or emotion. Alternatively, the continuous estimation of intermediate articulatory features, fed into a physical model of the vocal tract, would likely generate more natural-sounding speech outputs [50].…”
Section: Minimal Feedback Delay Will Improve Sense Of Agencymentioning
confidence: 99%