2019
DOI: 10.1504/ijbm.2019.096565
|View full text |Cite
|
Sign up to set email alerts
|

Bone- and air-conduction speech combination method for speaker recognition

Abstract: In this paper, first, we report speaker recognition performance using bone-conduction speech based on an i-vector-based speaker recognition system, which is the current state-of-the-art method. In addition, we propose three speaker recognition methods combining bone-conduction speech and air-conduction speech: a feature combination method, a speaker model combination method, and a similarity score combination method. To evaluate the proposed methods, we conducted speaker recognition experiments using a part of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…In order to increase the performance of speaker recognition in adverse conditions such as noise, multimodality has been explored by numerous researchers. Alternate supplementary information such as lip reading 28 , speech recorded with non-invasive sensors like throat microphone 29 and bone conduction microphone 30 have been shown to provide large gains in performance in adverse noisy conditions. Information about the speaker is present in all audio modes of speech, whether conducted through air, bone, or skin.…”
Section: Multimodal Systems For Speaker Modelingmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to increase the performance of speaker recognition in adverse conditions such as noise, multimodality has been explored by numerous researchers. Alternate supplementary information such as lip reading 28 , speech recorded with non-invasive sensors like throat microphone 29 and bone conduction microphone 30 have been shown to provide large gains in performance in adverse noisy conditions. Information about the speaker is present in all audio modes of speech, whether conducted through air, bone, or skin.…”
Section: Multimodal Systems For Speaker Modelingmentioning
confidence: 99%
“…A late integration with standard microphone signals resulted in improved performance of 95.8% accuracy. Other researchers such as [33][34][35][36] too have explored throat microphone, bone conduction microphone GEMS EGG, and non-audible murmur microphone signals' combination with standard speech for improving the speaker modeling. Linear features such as LPCC, MFCC, and i-vectors were used in all these works.…”
Section: Multimodal Systems For Speaker Modelingmentioning
confidence: 99%
“…In terms of modality for identification, the multimodal approach has been recently applied to SID problem to enhance the SID system's robustness, in which besides air-conducted speech, the complementary sources such as throat microphone [22], bone conduction microphone [23,24], microphone array [25,26], and video [26] are added to the SID system to further improve the system's accuracy.…”
Section: Related Workmentioning
confidence: 99%
“…Pitch detection for BC speech is discussed in [2]. In [3], BC speech was utilized with AC speech for speaker recognition. Speaker verification is also described in [4].…”
Section: Introductionmentioning
confidence: 99%