2019
DOI: 10.1109/access.2019.2896453
|View full text |Cite
|
Sign up to set email alerts
|

Native Language Identification in Very Short Utterances Using Bidirectional Long Short-Term Memory Network

Abstract: Native language identification (NLI) is the task of identifying the first language of a user based on their speech or written text in a second language. In this paper, we propose the use of spectrogramand cochleagram-based features extracted from very short speech utterances (0.8 s on average) to infer the native language of an Urdu speaker. The bidirectional long short-term memory (BLSTM) neural networks are adopted for the classification of utterances among the native languages. A set of experiments is carri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(8 citation statements)
references
References 39 publications
0
8
0
Order By: Relevance
“…Table 1 presents a summary of the considered literature review. Initial quarrying experiments were executed in order to uncovered suitability of different deep learning architectures, hence; BLSTM was found to outperform all others [33], [34]. As a consequence in this paper, we propose using the BLSTM architecture for Arabic language identification.…”
Section: Literature Reviewmentioning
confidence: 95%
“…Table 1 presents a summary of the considered literature review. Initial quarrying experiments were executed in order to uncovered suitability of different deep learning architectures, hence; BLSTM was found to outperform all others [33], [34]. As a consequence in this paper, we propose using the BLSTM architecture for Arabic language identification.…”
Section: Literature Reviewmentioning
confidence: 95%
“…The modern endto-end language recognition models based on deep learning (DL) algorithms improves the performance by increasing the data set requirement and do not perform well for small data set. In duration-matched condition, DL algorithms like bidirectional long short term memory (BLSTM) and recurrent neural network (RNN) reported excellent performance for short duration language identification [23]. In duration mismatched condition, the performance of these algorithms degrade.…”
Section: Review Of Related Workmentioning
confidence: 99%
“…Prosody features also play a vital role in recognizing language from speech signal [1,2]. Prosodic features like pitch, energy, stress are different in tonal language compared to non-tonal languages [3,4]. Adeeba and Hussain [3] used BLSTM method to identify the Urdu language with short duration of speech signal.…”
Section: Literature Surveymentioning
confidence: 99%
“…Prosodic features like pitch, energy, stress are different in tonal language compared to non-tonal languages [3,4]. Adeeba and Hussain [3] used BLSTM method to identify the Urdu language with short duration of speech signal. The duration of speech signal is an average 8s.…”
Section: Literature Surveymentioning
confidence: 99%
See 1 more Smart Citation