2016
DOI: 10.14569/ijacsa.2016.070746
|View full text |Cite
|
Sign up to set email alerts
|

Investigating the Use of Machine Learning Algorithms in Detecting Gender of the Arabic Tweet Author

Abstract: Abstract-Twitter is one of the most popular social network sites on the Internet to share opinions and knowledge extensively. Many advertisers use these Tweets to collect some features and attributes of Tweeters to target specific groups of highly engaged people. Gender detection is a sub-field of sentiment analysis for extracting and predicting the gender of a Tweet author. In this paper, we aim to investigate the gender of Tweet authors using different classification mining techniques on Arabic language, suc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(6 citation statements)
references
References 7 publications
0
6
0
Order By: Relevance
“…In a subsequent study, Deitrick et al [21] showed that exploiting feature selection methods improves the results substantially on n-gram features. Several studies [5,6,37,39,42,47,59,61] also presented models for author gender detection of writers of the social media in different languages.…”
Section: Related Workmentioning
confidence: 99%
“…In a subsequent study, Deitrick et al [21] showed that exploiting feature selection methods improves the results substantially on n-gram features. Several studies [5,6,37,39,42,47,59,61] also presented models for author gender detection of writers of the social media in different languages.…”
Section: Related Workmentioning
confidence: 99%
“…There are several challenges that hinder the development of tools for Twitter data analytics in the Arabic language, the greatest being the complexity of the language itself. Research on Twitter data analytics in Arabic has begun to appear in recent years in various application domains (detecting authors' genders [12], detecting traffic related events [18,20,38], finding restaurants' reputations [13]) but the progress has been slow. Moreover, some works are available in Modern Standard Arabic (MSA), but in general (not specific to healthcare), the works on Arabic dialects are very limited in number and scope [10,14].…”
Section: Research Gapmentioning
confidence: 99%
“…Subsequently, the authors in [2] extend their work by experimenting with different machine learning algorithms, data subsets and feature selection methods, reporting accuracies up to 94%. The authors in [1] manually annotate tweets from Jordanian dialects with gender information. They show how the name of the author of the tweet can significantly improve the performance.…”
Section: Age Gender and Language Variety Identification In Arabicmentioning
confidence: 99%