2023
DOI: 10.3390/math11020435
|View full text |Cite
|
Sign up to set email alerts
|

Contextual Urdu Lemmatization Using Recurrent Neural Network Models

Abstract: In the field of natural language processing, machine translation is a colossally developing research area that helps humans communicate more effectively by bridging the linguistic gap. In machine translation, normalization and morphological analyses are the first and perhaps the most important modules for information retrieval (IR). To build a morphological analyzer, or to complete the normalization process, it is important to extract the correct root out of different words. Stemming and lemmatization are tech… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…(Plisson et al 2004;Stanković et al 2016;Nandathilaka, Ahangama, and Weerasuriya 2018)), machine learning-based (e.g. (Kestemont et al 2017;Freihat et al 2018;Manjavacas, Kadar, and Kestemont 2019;Akhmetov et al 2020;Karwatowski and Pietron 2022;Hafeez et al 2023)) or they employ a combination of these methods and represent hybrid models (e.g. (Ingason et al 2008;Sahala et al 2023)).…”
Section: Lemmatizationmentioning
confidence: 99%
See 1 more Smart Citation
“…(Plisson et al 2004;Stanković et al 2016;Nandathilaka, Ahangama, and Weerasuriya 2018)), machine learning-based (e.g. (Kestemont et al 2017;Freihat et al 2018;Manjavacas, Kadar, and Kestemont 2019;Akhmetov et al 2020;Karwatowski and Pietron 2022;Hafeez et al 2023)) or they employ a combination of these methods and represent hybrid models (e.g. (Ingason et al 2008;Sahala et al 2023)).…”
Section: Lemmatizationmentioning
confidence: 99%
“…random forest classification model (Akhmetov et al 2020) and maximum entropy classifier (Freihat et al 2018)), it seems that the researchers have lately been more oriented towards the deep learning-based approaches (e.g. (Kestemont et al 2017; Manjavacas, Kadar, and Kestemont 2019; Ezhilarasi and Maheswari 2021b; Karwatowski and Pietron 2022;Hafeez et al 2023)).…”
Section: Lemmatizationmentioning
confidence: 99%
“…They reported an accuracy of 86.5% for their lemmatizer. Another paper, [22], presented a lemmatization algorithm based on recurrent neural network models for the Urdu language to complete normalization and morphological processes effectively. The proposed model was trained and tested on two datasets, the Urdu Monolingual Corpus (UMC) and the Universal Dependencies Corpus of Urdu (UDU), and outperformed existing models in accuracy, precision, and recall, achieving F-scores of 0.96, 0.95, 0.95, and 0.95, respectively.…”
Section: Related Workmentioning
confidence: 99%