2013
DOI: 10.1016/j.csl.2012.10.004
|View full text |Cite
|
Sign up to set email alerts
|

The PASCAL CHiME speech separation and recognition challenge

Abstract: Distant microphone speech recognition systems that operate with humanlike robustness remain a distant goal. The key difficulty is that operating in everyday listening conditions entails processing a speech signal that is reverberantly mixed into a noise background composed of multiple competing sound sources. This paper describes a recent speech recognition evaluation that was designed to bring together researchers from multiple communities in order to foster novel approaches to this problem. The task was to i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
129
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 180 publications
(129 citation statements)
references
References 8 publications
0
129
0
Order By: Relevance
“…Regarding ASR, the variation of the word error rate (WER) as a function of the SNR was studied in several evaluation challenges, e.g., (Hirsch and Pearce, 2000;Barker et al, 2013). The adaptation of DNN acoustic models to specific acoustic conditions has been investigated, e.g., (Seltzer et al, 2013;Karanasou et al, 2014), however it has been evaluated in multi-condition settings rather than actual mismatched conditions.…”
Section: Introductionmentioning
confidence: 99%
“…Regarding ASR, the variation of the word error rate (WER) as a function of the SNR was studied in several evaluation challenges, e.g., (Hirsch and Pearce, 2000;Barker et al, 2013). The adaptation of DNN acoustic models to specific acoustic conditions has been investigated, e.g., (Seltzer et al, 2013;Karanasou et al, 2014), however it has been evaluated in multi-condition settings rather than actual mismatched conditions.…”
Section: Introductionmentioning
confidence: 99%
“…Development of such processing methods is especially important for the applications like "smart home" when microphones are located in several parts of the room and both location and orientation of speaker's head are unknown in advance. In order to promote the development of multi-microphone approaches in robust speech recognition, several international competitions like CHiME (Computational Hearing in Multisource Environments) Challenge [6][7][8][9] have been organized in recent years.…”
Section: State Of the Art In Study Areamentioning
confidence: 99%
“…This step was performed to simulate talking over a smartphone in di↵erent places. Di↵erent types of additive noise were then added, using the CHiME-2013 database [1] for simulating a living room environment (CHiME ), and data collected from the freesound platform 1 to simulate public transport (trains) environments. We collected, in total, 230 minutes of noise, to match the duration of the RECOLA database.…”
Section: Overview and Materialsmentioning
confidence: 99%