2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2017
DOI: 10.1109/waspaa.2017.8170010
|View full text |Cite
|
Sign up to set email alerts
|

Broadband doa estimation using convolutional neural networks trained with noise signals

Abstract: A convolution neural network (CNN) based classification method for broadband DOA estimation is proposed, where the phase component of the short-time Fourier transform coefficients of the received microphone signals are directly fed into the CNN and the features required for DOA estimation are learned during training. Since only the phase component of the input is used, the CNN can be trained with synthesized noise signals, thereby making the preparation of the training data set easier compared to using speech … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
198
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 228 publications
(211 citation statements)
references
References 11 publications
1
198
0
Order By: Relevance
“…We refer to this spectrum as the reverberated spectrum identifier. The reverberated speech t R c is corrupted by late reverberation which is known to have a detrimental effect on single-speaker DOA estimation [17,22]. Therefore, we also consider the spectrum |T E | of the signal containing only the direct component and the early reflections of the target signal as an identifier and call it the early spectrum identifier.…”
Section: Spectrum-based Identifiersmentioning
confidence: 99%
See 2 more Smart Citations
“…We refer to this spectrum as the reverberated spectrum identifier. The reverberated speech t R c is corrupted by late reverberation which is known to have a detrimental effect on single-speaker DOA estimation [17,22]. Therefore, we also consider the spectrum |T E | of the signal containing only the direct component and the early reflections of the target signal as an identifier and call it the early spectrum identifier.…”
Section: Spectrum-based Identifiersmentioning
confidence: 99%
“…To improve the robustness of DOA estimation, deep neural networks (DNNs) have been proposed to learn a mapping between signal features and a discretized DOA space [17][18][19][20][21]. Various features such as phasemaps [17,18] and GCC-PHAT [21] have been used as inputs.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In this paper, we want to exploit the capabilities of both QNNs and Ambisonics to analyze 3D sounds, and in particular we focus on the localization and detection of 3D sound events. Both tasks have been widely investigated recently by using convolutional neural networks (CNNs) [19][20][21][22][23][24][25]. They are also considered as a joint task in [26] for 3D sounds, but considering each microphone signal as a separate real-valued signal.…”
Section: Introductionmentioning
confidence: 99%
“…Other neural network structures, such as CNNs, are neural networks designed to process data in the form of multiple arrays (such as images with three colours channels) and contain convolutional and pooling layers [5]. CNNs have been used to estimate the DOA for speech separation in [12] and trained using synthesized noise signals, but recorded with a four-microphones array. In this paper, we present a system that is able to perform source localization and source separation.…”
Section: Introductionmentioning
confidence: 99%