ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683785
|View full text |Cite
|
Sign up to set email alerts
|

Segmentation, Classification, and Visualization of Orca Calls Using Deep Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…Potentially, it is possible to translate that stream of voltage fluctuations into signs, which could be interpreted by humans using approaches from deep learning based natural language processing such as machine translation, or even from animal linguistics, where e.g. killer whale calls are identified, segmented, extracted and classified from ongoing continuous sound streams according to recurring feature patterns [87][88][89][90]. By that, we are convinced that our approach might further push the progress in neuroscience in order to extract meaningful information from continuous electrophysiological data streams.…”
Section: Discussionmentioning
confidence: 99%
“…Potentially, it is possible to translate that stream of voltage fluctuations into signs, which could be interpreted by humans using approaches from deep learning based natural language processing such as machine translation, or even from animal linguistics, where e.g. killer whale calls are identified, segmented, extracted and classified from ongoing continuous sound streams according to recurring feature patterns [87][88][89][90]. By that, we are convinced that our approach might further push the progress in neuroscience in order to extract meaningful information from continuous electrophysiological data streams.…”
Section: Discussionmentioning
confidence: 99%
“…Motivated by the need for better conservation tools, researchers have begun applying speech recognition algorithms to animal vocalizations [ 14 ]. Schröter et al constructed a new model to recognize different types of killer whale calls [ 15 ]. The accuracy can reach 97%, and successfully visualized the feature map.…”
Section: Introductionmentioning
confidence: 99%
“…In order to use these methods, the one-dimensional audio signal must first be converted to a two-dimensional image-like representation. The most common audio visualization is the spectrogram, which has been used as the input to a variety of audio classification tasks [16] , [41] , [42] , [43] , [44] , [45] , [46] . In this paper, we focus on the application of transfer learning to classify respiratory sound characteristics, building on our previous work [16] .…”
Section: Introductionmentioning
confidence: 99%