2022
DOI: 10.1155/2022/7994191
|View full text |Cite
|
Sign up to set email alerts
|

Audio Segmentation Techniques and Applications Based on Deep Learning

Abstract: Audio processing has become an inseparable part of modern applications in domains ranging from health care to speech-controlled devices. In automated audio segmentation, deep learning plays a vital role. In this article, we are discussing audio segmentation based on deep learning. Audio segmentation divides the digital audio signal into a sequence of segments or frames and then classifies these into various classes such as speech recognition, music, or noise. Segmentation plays an important role in audio signa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 49 publications
0
0
0
Order By: Relevance
“…Machine learning models have also been applied in music classification tasks, where models like CNNs and SVMs are utilized to categorize music into genres, moods, or other attributes based on audio features extracted from the music [11][12][13][14][15][16][17][18]. Furthermore, models such as Deep Belief Networks (DBNs) and Recurrent Neural Networks (RNNs) have been employed in music transcription, where the goal is to convert audio signals into musical notation [19][20][21][22][23][24].…”
Section: Descriptions Of Machine Learning In Musicmentioning
confidence: 99%
“…Machine learning models have also been applied in music classification tasks, where models like CNNs and SVMs are utilized to categorize music into genres, moods, or other attributes based on audio features extracted from the music [11][12][13][14][15][16][17][18]. Furthermore, models such as Deep Belief Networks (DBNs) and Recurrent Neural Networks (RNNs) have been employed in music transcription, where the goal is to convert audio signals into musical notation [19][20][21][22][23][24].…”
Section: Descriptions Of Machine Learning In Musicmentioning
confidence: 99%
“…Audio segmentation is a technique that divides audio signals into a sequence of segments or frames, and each part contains audio information from a speech [36][37][38]. In this study, we present the voice activity detection (VAD) method for segmentation.…”
Section: Segmentationmentioning
confidence: 99%
“…Signal segmentation involves breaking down a continuous speech signal into smaller, more manageable frames. The following are the goals of segmentation: to maintain the speech as a wide-sense stationary (WSS) process for several feature extraction approaches [20], optimize feature extraction and machine learning procedures [27], and identify and isolate the individual speech sounds or words in a continuous stream of speech [28]. Stationarity implies that the statistical properties of the signal remain constant over time.…”
Section: Adaptive Segmentationmentioning
confidence: 99%