2018
DOI: 10.3844/jcssp.2018.485.490
|View full text |Cite
|
Sign up to set email alerts
|

Speech Segmentation Using Dynamic Windows and Thresholds for Arabic and English Languages

Abstract: Segmentation of audio data such as human speech (splitting each word in separate audio file-.WAV file) has been a major concern when working with multimedia such as recordings from radio or TV. The main focus of the segmentation of boundaries of spoken language has been on using energy and zero crossing thresholds for endpoint detection. Errors in endpoint detection are still a main cause of low accuracy of segmentation systems. The goal of this research is to develop an efficient algorithm in order to segment… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…But, both Copyright © 2019 MECS I.J. Intelligent Systems and Applications, 2019, 5, [9][10][11][12][13][14][15][16][17] splitting and assimilation are performed once only, not iterative, so that the splitting cannot split a segment into more than two segments and the assimilation cannot combine more than two segments into a bigger one [9]. A splitting procedure functions to reduce deletion and increase accuracy while the assimilation functions to reduce insertion [3].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…But, both Copyright © 2019 MECS I.J. Intelligent Systems and Applications, 2019, 5, [9][10][11][12][13][14][15][16][17] splitting and assimilation are performed once only, not iterative, so that the splitting cannot split a segment into more than two segments and the assimilation cannot combine more than two segments into a bigger one [9]. A splitting procedure functions to reduce deletion and increase accuracy while the assimilation functions to reduce insertion [3].…”
Section: Related Workmentioning
confidence: 99%
“…There are many methods to generate adaptive thresholds, such as Bayesian Network (BN) [10], Deep Neural Network (DNN) [11], Simulated Annealing (SA) [9], Dynamic Window (DW) [12], and Hidden Markov Model (HMM) [13]. Since the parameter tuning can be seen as an optimization problem, an evolutionary computation such as GA can be applied.…”
Section: Related Workmentioning
confidence: 99%