2022 International Conference on Electrical and Computing Technologies and Applications (ICECTA) 2022
DOI: 10.1109/icecta57148.2022.9990138
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid CNN-LSTM Speaker Identification Framework for Evaluating the Impact of Face Masks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…Furthermore, studies have explored the modeling of reverberation using comb filtering, which is utilized to simulate room acoustics and evaluate the performance of speech processing algorithms in reverbration conditions [ 10 ]. Traditional speaker identification systems rely on extracting features from speech signals and matching them with reference models [ 4 ]. MFCCs have been a common choice for feature extraction due to their effectiveness in clean speech conditions.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, studies have explored the modeling of reverberation using comb filtering, which is utilized to simulate room acoustics and evaluate the performance of speech processing algorithms in reverbration conditions [ 10 ]. Traditional speaker identification systems rely on extracting features from speech signals and matching them with reference models [ 4 ]. MFCCs have been a common choice for feature extraction due to their effectiveness in clean speech conditions.…”
Section: Related Workmentioning
confidence: 99%
“…SV finds applications in security contexts. Both SI and SV involve the creation of speaker models to be stored as references [ 4 , 5 ]. The process of SV is also referred to as speaker authentication, wherein the system either accepts or rejects the speaker’s identity claim.…”
Section: Introductionmentioning
confidence: 99%
“…In recent studies, a combination of CNN with variants of RNN has shown better performance in various areas. In the studies [40], [41] a hybrid network of CNN and LSTM models exhibited performance improvement in speaker verification and identification. Another study [42], employed a hybrid network of CNN and BiLSTM for language identification using spectrograms of speech and the results have shown improvement in existing works.…”
Section: Introductionmentioning
confidence: 99%
“…This process significantly influences the precision of facial recognition systems by [8] minimising redundant data, expediting machine learning, and facilitating the development of computer models. However, facial feature extraction within recognition systems faces contemporary challenges, including variations in lighting [9], occlusions [10], and facial expressions [11], such as wearing face masks [12]. Persistent challenges in the field of computer vision and forensic science include lower accuracy rates and the lack of a precise Deep Learning (DL)-based image feature extraction approach for suspect identification [13,14,15,16].…”
Section: Introductionmentioning
confidence: 99%