2018
DOI: 10.3390/app8101949
|View full text |Cite
|
Sign up to set email alerts
|

Domestic Cat Sound Classification Using Learned Features from Deep Neural Nets

Abstract: The domestic cat (Feliscatus) is one of the most attractive pets in the world, and it generates mysterious kinds of sound according to its mood and situation. In this paper, we deal with the automatic classification of cat sounds using machine learning. Machine learning approach for the classification requires class labeled data, so our work starts with building a small dataset named CatSound across 10 categories. Along with the original dataset, we increase the amount of data with various audio data augmentat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
53
0
3

Year Published

2019
2019
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 50 publications
(56 citation statements)
references
References 38 publications
0
53
0
3
Order By: Relevance
“… BIRDZ, the control and real-world audio dataset used in [24]. The real-world recordings were  CAT, the cat sound dataset was presented in [28,50]. In the following Table 1 we report the performance obtained by the four data augmentation protocols, comparing them with no augmentation (NoAUG) as baseline.…”
Section: Resultsmentioning
confidence: 99%
“… BIRDZ, the control and real-world audio dataset used in [24]. The real-world recordings were  CAT, the cat sound dataset was presented in [28,50]. In the following Table 1 we report the performance obtained by the four data augmentation protocols, comparing them with no augmentation (NoAUG) as baseline.…”
Section: Resultsmentioning
confidence: 99%
“…Besides, multi-scale feature extraction in frequency level is also important [56]. Intuitively, different frequency ranges dominate different acoustic events, which makes it hard to predict accurate results only with certain frequency range.…”
Section: Region Proposal Methodsmentioning
confidence: 99%
“…The goal of this study is to use a deep learning model to learn relevant distance-related features from the audio signals for classification. To accomplish other deep learning-based audio analysis goals, previous researchers [36,41] have represented the time domain audio signals by mel [33] spectrograms before training. The mel spectrogram is a time-frequency representation of an audio signal that compresses high-frequency components and focuses more on low-frequency components [36].…”
Section: Data Transformationmentioning
confidence: 99%