Hybrid Techniques for Identity Authentication

Zhao, Yao; Guo, Rui

doi:10.1109/icmtma.2019.00020

Cited by 4 publications

(1 citation statement)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With the development of automation technology, identity technology is widely used in various applications, such as financial transaction identity verification [1], [2], security access control [3]- [5], human-computer interaction [6], [7]. At the beginning, identification mainly depends on the password authentication, then it has been developed to use biometric identification methods such as face recognition and fingerprint recognition.…”

Section: Introductionmentioning

confidence: 99%

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition

Wang¹,

Wang

et al. 2020

Preprint

View full text Add to dashboard Cite

Lip motion reflects behavior characteristics of speakers, and thus can be used as a new kind of biometrics in speaker recognition. In the literature, lots of works used two-dimensional (2D) lip images to recognize speaker in a textdependent context. However, 2D lip easily suffers from various face orientations. To this end, in this work, we present a novel end-to-end 3D lip motion Network (3LMNet) by utilizing the sentence-level 3D lip motion (S3DLM) to recognize speakers in both the text-independent and text-dependent contexts. A new regional feedback module (RFM) is proposed to obtain attentions in different lip regions. Besides, prior knowledge of lip motion is investigated to complement RFM, where landmark-level and frame-level features are merged to form a better feature representation. Moreover, we present two methods, i.e., coordinate transformation and face posture correction to pre-process the LSD-AV dataset, which contains 68 speakers and 146 sentences per speaker. The evaluation results on this dataset demonstrate that our proposed 3LM-Net is superior to the baseline models, i.e., LSTM, VGG-16 and ResNet-34, and outperforms the state-of-the-art using 2D lip image as well as the 3D face. The code of this work is released at https://github.com/wutong18/Three-Dimensional-Lip-Motion-Network-for-Text-Independent-Speaker-Recognition.

show abstract