2013 IEEE Workshop on Automatic Speech Recognition and Understanding 2013
DOI: 10.1109/asru.2013.6707723
|View full text |Cite
|
Sign up to set email alerts
|

The second ‘CHiME’ speech separation and recognition challenge: An overview of challenge systems and outcomes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
73
0
2

Year Published

2014
2014
2021
2021

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 66 publications
(76 citation statements)
references
References 6 publications
1
73
0
2
Order By: Relevance
“…The first CHiME Challenge held in 2011 was the first concerted evaluation of ASR systems in a real-world domestic environment involving both reverberation and highly dynamic background noise made up of multiple sound source [50]. The second CHiME Challenge in 2013 was supported by the IEEE AASP, MLSP and SL Technical Committees [51]. The configuration considered by this Challenge was that of speech from a single target speaker being binaurally recorded in a domestic environment involving multisource background noise.…”
Section: Smart Home and Aalmentioning
confidence: 99%
“…The first CHiME Challenge held in 2011 was the first concerted evaluation of ASR systems in a real-world domestic environment involving both reverberation and highly dynamic background noise made up of multiple sound source [50]. The second CHiME Challenge in 2013 was supported by the IEEE AASP, MLSP and SL Technical Committees [51]. The configuration considered by this Challenge was that of speech from a single target speaker being binaurally recorded in a domestic environment involving multisource background noise.…”
Section: Smart Home and Aalmentioning
confidence: 99%
“…In comparing our results against those obtained by the actual participants of the CHiME Challenge [20], ours are among the top two. Note that the CHiME challenge participants employed strategies at the spatial signal, feature and model levels [21] Table 2. WER under different SNRs.…”
Section: Speech Recognitionmentioning
confidence: 99%
“…In the previous two-channel CHiME challenges Vincent et al, 2013a) target enhancement has been achieved using mixed strategies exploiting both spatial and spectral diversity. However, the CHiME-3 scenario, with 5-forward facing microphones, a relatively fixed speaker location and wide, open environments lends itself strongly to multichannel beamforming approaches.…”
Section: Target Enhancementmentioning
confidence: 99%
“…The system is based on the Kaldi DNN-system recipe for Track 2 of the 2nd CHiME challenge Vincent et al, 2013a). Feature vectors are constructed from concatenating 7 frames of 13 dimensional Mel-frequency cepstral coefficients (MFCCs) then compressing to 40 dimensions using LDA with one of 2500 tied tri-phone HMM states as the class.…”
Section: Speech Recognitionmentioning
confidence: 99%