Advanced Techniques in Computing Sciences and Software Engineering 2009
DOI: 10.1007/978-90-481-3660-5_47
|View full text |Cite
|
Sign up to set email alerts
|

Voiced/Unvoiced Decision for Speech Signals Based on Zero-Crossing Rate and Energy

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
66
0
4

Year Published

2011
2011
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 120 publications
(72 citation statements)
references
References 6 publications
2
66
0
4
Order By: Relevance
“…if frame t is speech (8) where t p is the previous noise frame and β is the forgetting factor of value 0 < β < 1.…”
Section: Likelihood Ratio Measurementioning
confidence: 99%
See 1 more Smart Citation
“…if frame t is speech (8) where t p is the previous noise frame and β is the forgetting factor of value 0 < β < 1.…”
Section: Likelihood Ratio Measurementioning
confidence: 99%
“…Many researchers have studied different methods to develop an efficient VAD and most of them are heuristics using different speech parameters, such as, energy [5], [6], [7], zero crossing rate [2], [8], cepstral [9], LPC [10], etc. However, the algorithms based on speech features with heuristic rules have difficulty in coping with real world noises at low SNR conditions.…”
Section: Introductionmentioning
confidence: 99%
“…Indicative examples of time-domain estimators include the zero-crossing-rate (ZCR) [3][4], the measurement of energy level [3] [4], the peak-to-valley difference (PVD) [2] and the autocorrelation (ACORR) [5]. The measurement of the energy level and the ACORR methods again rely on an adaptive threshold for background noises, and tend to fail when the magnitude of the noises approaches or exceeds that of the voiced sounds, even when separated in time.…”
Section: Motivation and Related Workmentioning
confidence: 99%
“…If the number of zero crossings is more in a given signal, then the signal is changing rapidly and accordingly the signal may contain high frequency information which is termed as unvoiced speech. On the other hand, if the number of zero crossing is less, then the signal is changing slowly and accordingly the signal may contain low frequency information which is termed as voiced speech [17]. That's why the Zero Crossing Rate can gives information about the frequency content of the signal, which can be considered as a good indicator about the speaker itself.…”
Section: B Short Time Zero Crossing Rate (Stzcr)mentioning
confidence: 99%