2015
DOI: 10.3389/fnins.2015.00309
|View full text |Cite
|
Sign up to set email alerts
|

Sound stream segregation: a neuromorphic approach to solve the “cocktail party problem” in real-time

Abstract: The human auditory system has the ability to segregate complex auditory scenes into a foreground component and a background, allowing us to listen to specific speech sounds from a mixture of sounds. Selective attention plays a crucial role in this process, colloquially known as the “cocktail party effect.” It has not been possible to build a machine that can emulate this human ability in real-time. Here, we have developed a framework for the implementation of a neuromorphic sound segregation algorithm in a Fie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 46 publications
0
15
0
Order By: Relevance
“…Computational models of auditory scene analysis vary in their fundamental goals; while some attempt to address the complexity that the auditory system faces when processing realistic sounds (such as speech; Nix and Hohmann, 2007; Elhilali and Shamma, 2008; Krishnan et al, 2014; Thakur et al, 2015) in natural environments, others (Wang and Chang, 2008; Boes et al, 2011; Mill et al, 2013; Barniv and Nelken, 2015; Rankin et al, 2015) are built in order to test the potential of some algorithm to simulate specific behavioral and/or neurophysiological experiments. For example, Wang and Chang (2008) measure the fitness of their model based on its ability to reproduce the fission and temporal coherence boundaries reported by van Noorden (1975).…”
Section: Modeling Auditory Scene Analysismentioning
confidence: 99%
See 3 more Smart Citations
“…Computational models of auditory scene analysis vary in their fundamental goals; while some attempt to address the complexity that the auditory system faces when processing realistic sounds (such as speech; Nix and Hohmann, 2007; Elhilali and Shamma, 2008; Krishnan et al, 2014; Thakur et al, 2015) in natural environments, others (Wang and Chang, 2008; Boes et al, 2011; Mill et al, 2013; Barniv and Nelken, 2015; Rankin et al, 2015) are built in order to test the potential of some algorithm to simulate specific behavioral and/or neurophysiological experiments. For example, Wang and Chang (2008) measure the fitness of their model based on its ability to reproduce the fission and temporal coherence boundaries reported by van Noorden (1975).…”
Section: Modeling Auditory Scene Analysismentioning
confidence: 99%
“…This representation is used in all of the models in this section, although the dimensions used can vary among them. The neuromorphic model of Thakur et al (2015) is a computationally simplified version of Krishnan et al's (2014) model aiming only to segregate foreground and background, but in real-time. It introduces a formulation of selective attention, which works as an a-priori defined mask on the stimulus representation, selecting a subset of the coincidence matrices for computation.…”
Section: Modeling Auditory Scene Analysismentioning
confidence: 99%
See 2 more Smart Citations
“…Stream segregation depends critically on the distinctiveness of spectro-temporal features of different speakers (Bronkhorst, 2015; Elhilali et al, 2009; Moore and Gockel, 2002; Shamma et al, 2011; Thakur et al, 2015). For example, it is more difficult to segregate two recordings spoken by the same voice vs. those spoken by different speakers (Brungart et al 2001; Ericson et al 2004), and segregation of temporally asynchronous utterances is easier than that of synchronous speech (Humes et al, 2017).…”
Section: Introductionmentioning
confidence: 99%