2013
DOI: 10.1121/1.4795143
|View full text |Cite
|
Sign up to set email alerts
|

Autocorrelation-based features for speech representation

Abstract: This study investigates autocorrelation-based features as a potential basis for phonetic and syllabic distinctions. The work comes out of a theory of auditory signal processing based on central monaural autocorrelation and binaural crosscorrelation representations. Correlation-based features are used to predict monaural and binaural perceptual attributes that are important for the architectural acoustic design of concert halls: pitch, timbre, loudness, duration, reverberation-related coloration, sound directio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…The maximum relation is given by +1, the minimum relation is given by −1, and the absence of any relation is represented by 0. For example, the correlation at some value of lag is less than 1 but greater than 0 depending on the extent of similarity [64]. Therefore, the correlation at zero lag will always be 1 since the signal is repeated undelayed.…”
Section: ) Time Domain Featuresmentioning
confidence: 99%
“…The maximum relation is given by +1, the minimum relation is given by −1, and the absence of any relation is represented by 0. For example, the correlation at some value of lag is less than 1 but greater than 0 depending on the extent of similarity [64]. Therefore, the correlation at zero lag will always be 1 since the signal is repeated undelayed.…”
Section: ) Time Domain Featuresmentioning
confidence: 99%
“…The volume of a sound is one of the most straightforward features of the human auditory system. It has been used in acoustic scene classification [ 95 ], speech and music classification [ 96 ], and speech segmentation.…”
Section: Volumementioning
confidence: 99%
“…Temporal centriod(TC) is the center of gravity of the signal's energy on the time axis, in seconds(s), which reflects the area where the signal's main energy is concentrated [31], and is an important parameter for describing high-transient shock signals. The formula is as follows:…”
Section: ) Time-domain Objective Characteristic Parametersmentioning
confidence: 99%