Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-2687
|View full text |Cite
|
Sign up to set email alerts
|

GEV Beamforming Supported by DOA-Based Masks Generated on Pairs of Microphones

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

4
4

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…In future work, additional functionalities will be added to ODAS, including new algorithms that rely on deep learning based methods, as machine learning has became a powerful tool when combined with digital signal processing for sound source localization Chakrabarty and Habets (2019) , speech enhancement Valin (2018) ; Valin et al (2020) and sound source classification Ford et al (2019) ; Grondin et al (2019) . Additional beamforming methods could also be implemented, including Minimum Variance Distortionless Response (MVDR) beamformer Habets et al (2009) , and generalized eigenvalue (GEV) beamforming Heymann et al (2015) ; Grondin et al (2020) , as these approaches are particularly suited for preprocessing before automatic speech recognition. ODAS would also benefit from ego-noise suppression algorithms to mitigate the impact of motor noise while doing sound source localization, tracking and separation.…”
Section: Discussionmentioning
confidence: 99%
“…In future work, additional functionalities will be added to ODAS, including new algorithms that rely on deep learning based methods, as machine learning has became a powerful tool when combined with digital signal processing for sound source localization Chakrabarty and Habets (2019) , speech enhancement Valin (2018) ; Valin et al (2020) and sound source classification Ford et al (2019) ; Grondin et al (2019) . Additional beamforming methods could also be implemented, including Minimum Variance Distortionless Response (MVDR) beamformer Habets et al (2009) , and generalized eigenvalue (GEV) beamforming Heymann et al (2015) ; Grondin et al (2020) , as these approaches are particularly suited for preprocessing before automatic speech recognition. ODAS would also benefit from ego-noise suppression algorithms to mitigate the impact of motor noise while doing sound source localization, tracking and separation.…”
Section: Discussionmentioning
confidence: 99%
“…In future work, additional functionalities will be added to ODAS, including new algorithms that rely on deep learning based methods, as machine learning has became a powerful tool when combined with digital signal processing for sound source localization [33], speech enhancement [34], [35] and sound source classification [36], [37]. Additional beamforming methods could also be implemented, including Minimum Variance Distortionless Response (MVDR) beamformer [38], and generalized eigenvalue (GEV) beamforming [39], [40], as these approaches are particularly suited for preprocessing before automatic speech recognition.…”
Section: Discussionmentioning
confidence: 99%
“…We denote the SCMs for the target speech, the interfering noise and the resulting mixture as R XX [k] ∈ C M ×M , respectively, where M stands for the number of microphones. The SCMs for speech and noise can be obtained using time-frequency masks [56,103,104].…”
Section: A31 Multi-microphone Signal Processingmentioning
confidence: 99%

SpeechBrain: A General-Purpose Speech Toolkit

Ravanelli,
Parcollet,
Plantinga
et al. 2021
Preprint
Self Cite