2023
DOI: 10.1109/access.2023.3240181
|View full text |Cite
|
Sign up to set email alerts
|

A Pitch Estimation Algorithm for Speech in Complex Noise Environments Based on the Radon Transform

Abstract: The pitch period as an essential feature is used in various speech-related works. Most actual projects collect speech signals in complex noise environments. Thus, the noise resistance of the algorithm for accurate pitch estimation has become more critical than ever. However, many state-of-the-art algorithms fail to obtain good results when dealing with noisy speech files at a low signal-to-noise ratio (SNR) value. This study presents a new noise-resistant pitch estimation algorithm based on the Radon transform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 39 publications
0
2
0
Order By: Relevance
“…In recent years, numerous strategies have been developed to enhance the effectiveness of overcoming and mitigating the impacts of background noise. [27] employs the Radon transform and proposes an innovative approach of pitch estimation for speech in challenging noise environments, integrating the Viterbi algorithm to smooth pitch patterns and mitigating the impact of formants using both logarithmic and power functions. [28] relies on introducing a practical connection between the fundamental frequency (F 0 ) and the instantaneous frequency (F i ).…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, numerous strategies have been developed to enhance the effectiveness of overcoming and mitigating the impacts of background noise. [27] employs the Radon transform and proposes an innovative approach of pitch estimation for speech in challenging noise environments, integrating the Viterbi algorithm to smooth pitch patterns and mitigating the impact of formants using both logarithmic and power functions. [28] relies on introducing a practical connection between the fundamental frequency (F 0 ) and the instantaneous frequency (F i ).…”
Section: Introductionmentioning
confidence: 99%
“…The key step is obtaining the required geometric information through feature extraction and matching by searching for image pairs with overlapping scenes in 2D image sequences obtained from different views. Methods such as orthogonal polynomials (OPs) [10], [11], Krawtchouk moments [12], discrete tchebichef polynomials [13], [14] and Hahn polynomials [15], [16], [17] can extract image features. According to the obtained geometric information and environmental parameters, the disparity (target depth) is obtained for the 3D reconstruction [18], [19].…”
mentioning
confidence: 99%