2021
DOI: 10.13164/mendel.2021.2.044
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid Deep Learning Model for Singing Voice Separation

Abstract: Monaural source separation is a challenging issue due to the fact that there is only a single channel available; however, there is an unlimited range of possible solutions. In this paper, a monaural source separation model based hybrid deep learning model, which consists of convolution neural network (CNN), dense neural network (DNN) and recurrent neural network (RNN), will be presented. A trial and error method will be used to optimize the number of layers in the proposed model. Moreover, the effects of the l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…Recently, the deep learning approaches has been widely used to expedite Covid-19 detection and to improve accuracy in biomedical research [19]. Deep learning demonstrated a robust performance in many applications such as, medical image detection [2], data classification, image segmentation and speech recognition [8], etc. Chest radiographs are used to diagnose patients with Covid-19 infections, as the virus primarily affects the lungs.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, the deep learning approaches has been widely used to expedite Covid-19 detection and to improve accuracy in biomedical research [19]. Deep learning demonstrated a robust performance in many applications such as, medical image detection [2], data classification, image segmentation and speech recognition [8], etc. Chest radiographs are used to diagnose patients with Covid-19 infections, as the virus primarily affects the lungs.…”
Section: Related Workmentioning
confidence: 99%
“…Early works for speech separation solely utilized the audio signal [3][4][5][6][7]. The audio only speech separation problem is inherently challenging due to its ambiguity, making it difficult to achieve satisfactory outcomes without additional information, for example, prior knowledge or certain microphone configuration.…”
Section: Introductionmentioning
confidence: 99%
“…The process of distinguishing particular audio sources from a mixture of audio signals using visual indicators as further information is known as audio-visual source separation. This method differs from conventional ones that exclusively rely on the audio stream for source separation [1][2][3][4][5]. In other words, it is a technique that utilizes both auditory and visual information to separate individual sound sources from a mixed audio signal.…”
Section: Introductionmentioning
confidence: 99%