2014
DOI: 10.1121/1.4836055
|View full text |Cite
|
Sign up to set email alerts
|

Detection of the closure-burst transitions of stops and affricates in continuous speech using the plosion index

Abstract: Automatic and accurate detection of the closure-burst transition events of stops and affricates serves many applications in speech processing. A temporal measure named the plosion index is proposed to detect such events, which are characterized by an abrupt increase in energy. Using the maxima of the pitch-synchronous normalized cross correlation as an additional temporal feature, a rule-based algorithm is designed that aims at selecting only those events associated with the closure-burst transitions of stops … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 35 publications
(16 citation statements)
references
References 29 publications
0
16
0
Order By: Relevance
“…For detecting the burst onsets of stops we adopt the solution proposed in our earlier work. 10 In this paper, we address the latter problem of detection of the voice onsets.…”
Section: Problem Formulationmentioning
confidence: 99%
See 3 more Smart Citations
“…For detecting the burst onsets of stops we adopt the solution proposed in our earlier work. 10 In this paper, we address the latter problem of detection of the voice onsets.…”
Section: Problem Formulationmentioning
confidence: 99%
“…In our earlier work, 10 the very first instant within a stop burst where the feature plosion index (PI) exceeds a threshold was taken to be the representative closure burst transition (CBT) for that stop. This may correspond to the beginning of the prefrication interval.…”
Section: Reference Instants For the Measurement Of Votmentioning
confidence: 99%
See 2 more Smart Citations
“…Because only the voiced regions are of interest, a voiced/unvoiced (V/UV) classification scheme based on maximum normalized cross correlation is used to retain only the voiced regions, as in Ref. 18. The ILPR is obtained by inverse filtering three successive pitch periods, retaining only the middle period of the output and repeating the process by shifting the analysis frame by one pitch period till the entire voiced speech segment is traversed.…”
Section: Pitch Synchronous Discrete Cosine Transform and The Number Omentioning
confidence: 99%