2019
DOI: 10.5391/ijfis.2019.19.4.283
|View full text |Cite
|
Sign up to set email alerts
|

Identifying Personality Traits for Indonesian User from Twitter Dataset

Abstract: Social media allows the user to convey their actual self and share their life experiences through numerous ways. This behavior in turn reflects the user's personality. In this paper, we experiment to automatically predict user's personality based on Big Five Personality Trait on Twitter. Our focus is towards Indonesian user. Not only word n-gram, Twitter metadata is also used in a certain combination to determine the feature that will be used to predict the personality. Our research also attempts to find optim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 20 publications
0
6
0
Order By: Relevance
“…At the initiation stage, data collection was carried out to increase the amount of Twitter data that had been collected from previous studies [3,19,27]. Twitter data that has been collected manually will be annotated with the help of a psychological expert to define the personality of each Twitter user.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…At the initiation stage, data collection was carried out to increase the amount of Twitter data that had been collected from previous studies [3,19,27]. Twitter data that has been collected manually will be annotated with the help of a psychological expert to define the personality of each Twitter user.…”
Section: Methodsmentioning
confidence: 99%
“…This is caused by due to small number of dataset used in this study to capture much more contextual information in creating generalized model. Meanwhile, another study using Twitter dataset in Bahasa which were carried out by [3,19,31] used a different algorithm in building the personality prediction model. In feature extraction methods, the researcher was assessing the tendency user choice of words by using n-gram and LIWC.…”
Section: Related Workmentioning
confidence: 99%
“…The details for each phase can be seen in figure 1. For the initial phase, data collected from previous research [9,11,12] has been collected for Twitter dataset. The label for defining the Big Five traits has been annotated by physiological experts.…”
Section: Methodsmentioning
confidence: 99%
“…Their research uses SGD (Stochastic Gradient Descent), XGBoost, and super learner to produce a good ROC-AUC (Receiver Operating Characteristic and Area Under Curve) score. Research conducted by [12] also uses Twitter dataset with more data that consists of tweets from 508 users. This research use word-n-gram and Twitter metadata to process using Random Forest classifier to produce 0,744 f1 scores on average.…”
Section: Related Workmentioning
confidence: 99%
“…In the classification of the Big Five Personality, which was done by Jeremy [11], there is an addition of 4 feature extraction approaches. This research-based on metadata approaches such as the number of followers, following, tweets, favorites, retweets, mentions, quotes, replies, and hashtags.…”
Section: Previous Workmentioning
confidence: 99%