Deep Learning Based Cross Domain Sentiment Classification for Urdu Language

Altaf, Amna; Jamal, M. Hasan; Jamal, Muhammad Hasan; Hassan, Sahar; Bajwa, Usama Ijaz; Choi, Gyu Sang; Ashraf, Imran

doi:10.1109/access.2022.3208164

Cited by 12 publications

(15 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One-hot encoding [35], [74] CBOW [96], , [97], [64], [68], [89], [75], [104] Skip Gram [80], [62], [68], [89] Embedding [89], [83], [69], [70], [91], [84], [127], [81],…”

Section: Table 9: Feature Extraction Methods Featurementioning

confidence: 99%

“…A. Altaf et al, [35] highlighted that the majority of sentiment analysis research in the Urdu language relies on certain domains, and models are frequently tested and trained on the same dataset. for a small number of domains, to leverage the problem they develop a method model that adapts a cross-domain sentiment analysis in Urdu languages.…”

Section: A Non-pretrained Techniquesmentioning

confidence: 99%

“…Nonetheless, challenges persist, particularly in accurately representing all languages within tokenizer frameworks due to variations in language characters and other linguistic factors. Data cleaning [95], [35], [96], [49], [62], [65], [58], [68], [67], [89], [74], [75], [76], [77], [88], [94], [100], [103], [104], [107], [80], [106], [81], [108], [109], [82], [53], [110], [111], [83], [92], [113], [114], [115], [116], [117] Stemming [51], [67], [104], [108],…”

Section: Transfer Learningmentioning

confidence: 99%

“…Additionally, one-hot encoding is mentioned in a few like [35] and [74], suggesting its presence in certain studies despite it various limitations. Embedding techniques, including both traditional embeddings and contextual embeddings, emerge as popular choices, with numerous studies highlighting their effectiveness.…”

Section: Transfer Learningmentioning

confidence: 99%

“…Sentiment classification methods are the methods used in categorizing text into polarities. They are classified into three categories: lexicon-based, machine learning (ML), and hybrid [12,24,[28][29][30][31][32][33][34][35]. However, the lexicon-based method relies on a predefined set of patterns, often referred to as a sentiment dictionary or lexicon, where each data entry is linked with a specific sentiment orientation [17,36].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Sentiment Analysis in Low-Resource Settings: A Comprehensive Review of Approaches, Languages, and Data Sources

Aliyu,

Sarlan,

Usman Danyaro

et al. 2024

IEEE Access

View full text Add to dashboard Cite

The field of low-resource sentiment analysis has seen significant developments in recent years. This research review SLR evaluates the approaches and data sources utilized in low-resource sentiment analysis by deep learning. The primary aim is to discover suitable approaches for future sentiment analysis in low-resource. Our studies explore various languages, models, and data sources expressing a desire to create effective approaches. Our emphasis lies in the critical evaluation of the approaches and the datasets utilized, to identify areas where further research is needed. Our analysis study adds to the existing body of literature reviews, encompassing multilingual low-resource sentiment analysis research spanning from 2018 to 2023. The findings indicate that the transfer learning approach is the most frequently used, followed by word embedding learning and machine translation systems. Additionally, the study shows that social media is the most used platform for data collection, followed by product reviews, movies, and hotels. There has been a significant surge in the adoption of pre-trained transformers, indicating a growing interest in exploring the potential of these models for low-resource languages within the natural language processing (NLP) community. This trend is largely attributed to the novel nature of these models and their feature of being nonlabour intensive. However, the scarcity of annotated datasets for such languages remains a major hurdle. finally, these research findings are relevant and informative for any researcher working in the field of lowresource multilingual sentiment analysis. The study introduces a conceptual framework for performing sentiment analysis in low-resource. The study provides a valuable resource for future researchers.

show abstract

“…One-hot encoding [35], [74] CBOW [96], , [97], [64], [68], [89], [75], [104] Skip Gram [80], [62], [68], [89] Embedding [89], [83], [69], [70], [91], [84], [127], [81],…”

Section: Table 9: Feature Extraction Methods Featurementioning

confidence: 99%

Section: A Non-pretrained Techniquesmentioning

confidence: 99%