ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9746624
|View full text |Cite
|
Sign up to set email alerts
|

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 25 publications
0
10
0
Order By: Relevance
“…Input Feature Target Multi-source Frame-wise Variable-array (vs. Single-source) (vs. Chunk-wise) (vs. Fixed-array ) [9] 2021 Mag + Phase Spatial spectrum regression [10] 2021 Intensity vector Multi-class location classification [11] 2021 Mag + Phase DP-RTF regression (2-channel) [12] 2021 SRP-PHAT Spectrogram Location regression [13] 2022 SRP-PHAT Spectrogram Location regression [14] 2022 STFT Coefficients Spatial spectrum regression [15] 2022 Mag + IPD Multi-track spatial spectrum regression [16] 2022 Mag + Phase Mixed DP-IPD regression [17] 2022 GCC-PHAT + Array Geometry Location classification (constant-channel) [18] 2023 MFCC and Mel features Multi-class location classification [19] 2023 SRP-PHAT Spectrogram Location regression [20] 2023 STFT Coefficients DP-IPD regression Proposed -STFT Coefficients Multi-track DP-IPD regression array or uses a fixed microphone array. In [17], by also taking as input the microphone array geometry along the localization feature to the network, the network can perform SSL for variable arrays.…”
Section: Methods Yearmentioning
confidence: 99%
See 3 more Smart Citations
“…Input Feature Target Multi-source Frame-wise Variable-array (vs. Single-source) (vs. Chunk-wise) (vs. Fixed-array ) [9] 2021 Mag + Phase Spatial spectrum regression [10] 2021 Intensity vector Multi-class location classification [11] 2021 Mag + Phase DP-RTF regression (2-channel) [12] 2021 SRP-PHAT Spectrogram Location regression [13] 2022 SRP-PHAT Spectrogram Location regression [14] 2022 STFT Coefficients Spatial spectrum regression [15] 2022 Mag + IPD Multi-track spatial spectrum regression [16] 2022 Mag + Phase Mixed DP-IPD regression [17] 2022 GCC-PHAT + Array Geometry Location classification (constant-channel) [18] 2023 MFCC and Mel features Multi-class location classification [19] 2023 SRP-PHAT Spectrogram Location regression [20] 2023 STFT Coefficients DP-IPD regression Proposed -STFT Coefficients Multi-track DP-IPD regression array or uses a fixed microphone array. In [17], by also taking as input the microphone array geometry along the localization feature to the network, the network can perform SSL for variable arrays.…”
Section: Methods Yearmentioning
confidence: 99%
“…Various network architectures have been adopted for SSL, among which convolutional neural networks (CNN) [9], [13], [14], [19] and convolutional recurrent neural Networks [11], [16], [18] (CRNN) are the most commonly used networks. These networks are all designed to process all the frequencies together.…”
Section: Related Work a Deep Learning Based Sound Source Localizationmentioning
confidence: 99%
See 2 more Smart Citations
“…Recent work also explored distributed microphone arrays 1 . However, they did not satisfy the above goals: they were evaluated in simulated or strongly constrained environments [22][23][24][25] , required exact microphone positions [26][27][28][29] , used wired setups to achieve synchronization 26,30,31 , localized only 1-2 speakers [31][32][33][34][35] , or assumed a priori knowledge about the number of speakers [36][37][38] .…”
Section: Speech Separation and 2d Localizationmentioning
confidence: 99%