2022
DOI: 10.3389/fnins.2022.1004071
|View full text |Cite
|
Sign up to set email alerts
|

A biologically oriented algorithm for spatial sound segregation

Abstract: Listening in an acoustically cluttered scene remains a difficult task for both machines and hearing-impaired listeners. Normal-hearing listeners accomplish this task with relative ease by segregating the scene into its constituent sound sources, then selecting and attending to a target source. An assistive listening device that mimics the biological mechanisms underlying this behavior may provide an effective solution for those with difficulty listening in acoustically cluttered environments (e.g., a cocktail … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

3
0

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 62 publications
0
9
0
Order By: Relevance
“…29 STOI ranges between 0 and 1 and can be used to predict speech intelligibility when combined with an appropriate mapping function. The time-constant of the alpha function kernel (τ h ), and the scaling factor for DiffMask ( a ) (see Chou et al 14 for details) were chosen by iteratively trying a range of values, quantifying algorithm performance using STOI, then choosing parameter values that produced the highest average STOI. For RatioMask, the beta (ß) parameter value of 1.65 was selected using the genetic algorithm (GA) function in the MATLAB Global Optimization Toolbox with “fitness” defined as the average STOI value.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…29 STOI ranges between 0 and 1 and can be used to predict speech intelligibility when combined with an appropriate mapping function. The time-constant of the alpha function kernel (τ h ), and the scaling factor for DiffMask ( a ) (see Chou et al 14 for details) were chosen by iteratively trying a range of values, quantifying algorithm performance using STOI, then choosing parameter values that produced the highest average STOI. For RatioMask, the beta (ß) parameter value of 1.65 was selected using the genetic algorithm (GA) function in the MATLAB Global Optimization Toolbox with “fitness” defined as the average STOI value.…”
Section: Methodsmentioning
confidence: 99%
“…The subtractive term acts as a mechanism for sharpening the spatial tuning of output neurons as demonstrated in a previous publication. 14 The scale factor a was adjusted to normalize the mask to a maximum value of 1. In contrast to DiffMask, which employs a subtractive operation, RatioMask implements a multiplicative operation to sharpen the spatial tuning of output neurons in the presence of competing noise.…”
Section: Bossamentioning
confidence: 99%
See 1 more Smart Citation
“…CSA can be broken down into several distinct components: determining the spatial location of a target stimulus, segregating the target stimulus from the competing stimuli in the scene, and reconstructing the target stimulus from the mixture forming a perceptual object (Bizley, Maddox and Lee, 2016). Recently, we proposed a brain-inspired algorithm for auditory scene analysis based on a model of cortical neurons (Maddox et al, 2012; Dong, Colburn and Sen, 2016; Chou et al, 2022). This algorithm has the potential to be applied in assistive devices for CSA.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, we proposed a brain-inspired algorithm for auditory scene analysis based on a model of cortical neurons (Maddox et al, 2012 ; Dong et al, 2016 ; Chou et al, 2022 ). This algorithm has the potential to be applied in assistive devices for CSA.…”
Section: Introductionmentioning
confidence: 99%