2021
DOI: 10.1371/journal.pone.0259140
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting deep neural network and long short-term memory method-ologies in bioacoustic classification of LPC-based features

Abstract: The research describes the recognition and classification of the acoustic characteristics of amphibians using deep learning of deep neural network (DNN) and long short-term memory (LSTM) for biological applications. First, original data is collected from 32 species of frogs and 3 species of toads commonly found in Taiwan. Secondly, two digital filtering algorithms, linear predictive coding (LPC) and Mel-frequency cepstral coefficient (MFCC), are respectively used to collect amphibian bioacoustic features and c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 52 publications
0
5
0
Order By: Relevance
“…𝑥 (𝑘) is the prediction sample, and 𝛼 is determined by 𝑒(𝑘) to minimize the mean square error (MSE). The equation is: Instead of using the MEMS microphone array, addressing these problems requires the use of appropriate filtering methods, and methods such as the Mel-scale frequency cepstral coefficient (MFCC) [17,18], Fast Fourier transform (FFT) [19,20], order-tracking technology [16] and wavelet transform [21] have been applied to machinery fault diagnosis. Linear prediction coefficients (LPC) have been applied to many modern speech processing systems for applications including coding, synthesis, analysis and recognition [22]; the initial model is constructed using historical data, and new data testing and verification can be used to predict the associated outcomes of audio signal data.…”
Section: Linear Predictive Coding Methodsmentioning
confidence: 99%
“…𝑥 (𝑘) is the prediction sample, and 𝛼 is determined by 𝑒(𝑘) to minimize the mean square error (MSE). The equation is: Instead of using the MEMS microphone array, addressing these problems requires the use of appropriate filtering methods, and methods such as the Mel-scale frequency cepstral coefficient (MFCC) [17,18], Fast Fourier transform (FFT) [19,20], order-tracking technology [16] and wavelet transform [21] have been applied to machinery fault diagnosis. Linear prediction coefficients (LPC) have been applied to many modern speech processing systems for applications including coding, synthesis, analysis and recognition [22]; the initial model is constructed using historical data, and new data testing and verification can be used to predict the associated outcomes of audio signal data.…”
Section: Linear Predictive Coding Methodsmentioning
confidence: 99%
“…The results showed that the method in [59] could accurately detect the lane-changing intention of a side car, so as to realize the advanced braking of the target vehicle when the side car changed lanes. The authors of [60] collected sound signals of vehicle tires in 12 different situations, and used SVM and MLP (multilayer perceptron) to classify and identify fault conditions. The main experimental procedure of [61] was divided into two parts.…”
Section: Svm (Support Vector Machine)mentioning
confidence: 99%
“…If the failure factor of the rotating element is not detected early, it can be a great threat to driving safety. As far as the field of acoustics research is concerned, the application of voiceprint recognition in automobile fault prediction is a very novel technology, but there are still many technical obstacles to overcome [60], such as complex sound field feature filtering, voiceprint data collection, digital filtering algorithms, and the selection of machine learning training models. Acoustic sensors are used to measure abnormal vehicle signals and thereby determine the abnormal characteristics of vehicle parts.…”
Section: Overview Of Vehicle Transmission Acoustic Signal Processingmentioning
confidence: 99%
See 1 more Smart Citation
“…Feature dimensionality reduction can enhance the performance of radiomics models [155]. Feature selection and dimensionality reduction serve a common purpose: addressing high dimensionality in data processing, where the number of features in a sample tends to increase linearly with the amount of data to be processed.…”
mentioning
confidence: 99%