2019
DOI: 10.1016/j.csl.2018.07.001
|View full text |Cite
|
Sign up to set email alerts
|

Prosodic boundary detection using syntactic and acoustic information

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 15 publications
0
5
0
1
Order By: Relevance
“…(KLIMKOV et. al., 2017); (1) Classificador Random Forest, usado para detectar os limites prosódicos reais usando um pequeno conjunto de recursos acústicos (KOCHAROV et. al., 2019); (1) SSML -Speech Synthesis Markup Language, que atribui uma variedade de tags de prosódia SSML com base na estrutura de temática de cada frase (DOMINGUEZ et.…”
Section: Discussionunclassified
“…(KLIMKOV et. al., 2017); (1) Classificador Random Forest, usado para detectar os limites prosódicos reais usando um pequeno conjunto de recursos acústicos (KOCHAROV et. al., 2019); (1) SSML -Speech Synthesis Markup Language, que atribui uma variedade de tags de prosódia SSML com base na estrutura de temática de cada frase (DOMINGUEZ et.…”
Section: Discussionunclassified
“…Many years of experience in collecting, processing and analysing speech material have enabled us to create speech corpora of all kinds that can serve as the basis for a wide range of fundamental and applied research. The fully annotated large corpus of read speech CORPRES laid the foundation for a lot of research projects including automatic prosodic boundary detection (Kocharov et al, 2019a), research on vowel reduction (Kocharov et al, 2019b) and phrase-final lengthening (Kachkovskaia et al, 2013), melodic declination (Kocharov et al, 2015), the melody of post-nucleus and others.…”
Section: Speech Corpora In Phonetic Researchmentioning
confidence: 99%
“…Systems for audio-based prosodic boundary detection have traditionally utilized combinations of different features such as the duration of pauses and syllables, F 0 range and resets, intensity, or pitch movement [4,9,11,14,15]. Rather than use such handcrafted combinations of features, however, we chose to employ learned representations from raw audio data.…”
Section: Model For Audio-based Detectionmentioning
confidence: 99%
“…There are two different scenarios for the automatic detection of prosodic boundaries: detection solely from text, most often for the purposes of speech synthesis [7,13,18,20,24], or detection from spoken utterances as a form of audio annotation. In the latter case, some approaches have been based solely on acoustic information (though sometimes with word or syllable boundaries derived from text transcripts) [11,[14][15][16], while others have combined both lexical and acoustic information [4,8,9].…”
Section: Introductionmentioning
confidence: 99%