2016
DOI: 10.3390/app6050143
|View full text |Cite
|
Sign up to set email alerts
|

A Review of Physical and Perceptual Feature Extraction Techniques for Speech, Music and Environmental Sounds

Abstract: Endowing machines with sensing capabilities similar to those of humans is a prevalent quest in engineering and computer science. In the pursuit of making computers sense their surroundings, a huge effort has been conducted to allow machines and computers to acquire, process, analyze and understand their environment in a human-like way. Focusing on the sense of hearing, the ability of computers to sense their acoustic environment as humans do goes by the name of machine hearing. To achieve this ambitious aim, t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
100
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 172 publications
(100 citation statements)
references
References 196 publications
(240 reference statements)
0
100
0
Order By: Relevance
“…The database was then used to evaluate a sound event classifier, which obtained better accuracies with foreground sound events than those perceived in the background. Hence, labelling the acoustic salience of audio events could permit more detailed sensitivity analyses of machine hearing approaches [33].…”
Section: Salience Of Environmental Acoustic Eventsmentioning
confidence: 99%
See 1 more Smart Citation
“…The database was then used to evaluate a sound event classifier, which obtained better accuracies with foreground sound events than those perceived in the background. Hence, labelling the acoustic salience of audio events could permit more detailed sensitivity analyses of machine hearing approaches [33].…”
Section: Salience Of Environmental Acoustic Eventsmentioning
confidence: 99%
“…This is of special importance when it comes to delimiting the boundaries of each sound event, i.e., determining the start and end points of each sound event in the mixed audio data [15,17] and the event/background ratio salience [24][25][26]. In the context of environmental sounds, it is important to highlight that acoustic events are usually disconnected from one another, which contrasts with speech or music where a strongly interconnected temporal structure of basic units is present (phonemes and notes, respectively) [33].…”
Section: Salience Of Environmental Acoustic Eventsmentioning
confidence: 99%
“…After extracting the features from the raw dataset, such features contain important information that is used by the learning algorithms for the activities discrimination. The most common methods of feature extraction work in time, frequency, and discrete domains [192]. Among time domain method, mean and standard deviation are the key approaches for almost all sensor types.…”
Section: ) Dimensionality Reductionmentioning
confidence: 99%
“…For an up-to-date review of feature extraction techniques in music we refer the reader to the study by Al ıas et al (2016).…”
Section: Feature Extractionmentioning
confidence: 99%