2021
DOI: 10.1007/978-981-15-8411-4_88
|View full text |Cite
|
Sign up to set email alerts
|

Optimization of MFCC Algorithm for Embedded Voice System

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 3 publications
0
3
0
Order By: Relevance
“…The utilization of an artificial neural network (ANN), a long short-term memory (LSTM), and an XGBoost model is presented, each contributing uniquely to enhancing the accuracy and robustness of the classification framework. Long short-term memory (LSTM): LSTM, a type of recurrent neural network (RNN), excels in sequence-based classification tasks by retaining and utilizing information from past inputs 55 . Its specialized architecture with memory cells allows it to capture intricate dependencies within sequential data, making it a valuable choice for applications such as time series prediction and natural language processing.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The utilization of an artificial neural network (ANN), a long short-term memory (LSTM), and an XGBoost model is presented, each contributing uniquely to enhancing the accuracy and robustness of the classification framework. Long short-term memory (LSTM): LSTM, a type of recurrent neural network (RNN), excels in sequence-based classification tasks by retaining and utilizing information from past inputs 55 . Its specialized architecture with memory cells allows it to capture intricate dependencies within sequential data, making it a valuable choice for applications such as time series prediction and natural language processing.…”
Section: Methodsmentioning
confidence: 99%
“…Long short-term memory (LSTM): LSTM, a type of recurrent neural network (RNN), excels in sequence-based classification tasks by retaining and utilizing information from past inputs 55 . Its specialized architecture with memory cells allows it to capture intricate dependencies within sequential data, making it a valuable choice for applications such as time series prediction and natural language processing.…”
Section: Methodsmentioning
confidence: 99%
“…If the voice signal is smoother and more uniform, the parameters for extracting voice features will be better and the quality of voice processing will be better.Pre-emphasis: The first step before processing the voice signal is to pre-emphasize the voice signal. The speech signal is pre-emphasized because it is affected by oral and nose radiation and glottal excitation, and the high-frequency end of the average power spectrum is attenuated by 6 dB above 800 Hz (Shi and Zhen, 2020). The pre-emphasis process generally uses a 6 dB high-frequency boost pre-emphasis digital filter to boost the high-frequency part of the voice signal.…”
Section: Related Workmentioning
confidence: 99%