RANLP 2017 - Recent Advances in Natural Language Processing Meet Deep Learning 2017
DOI: 10.26615/978-954-452-049-6_058
|View full text |Cite
|
Sign up to set email alerts
|

Gender Prediction for Chinese Social Media Data

Abstract: Social media provides users a platform to publish messages and socialize with others, and microblogs have gained more users than ever in recent years. With such usage, user profiling is a popular task in computational linguistics and text mining. Different approaches have been used to predict users' gender, age, and other information, but most of this work has been done on English and other Western languages. The goal of this project is to predict the gender of users based on their posts on Weibo, a Chinese mi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 15 publications
(12 reference statements)
0
3
0
Order By: Relevance
“…Although the high accuracy reported in the study, the methodology would not be accurate to model text data written in more formal scenarios such as customer reviews, product surveys, opinion posts, and customer service chats, which have a different structure compared to the texts that can be found in social media data. In other study [11], the authors proposed a system to classify the gender of the persons who wrote 100,000 posts from Weibo (Chinese social network similar to Tweeter). The system was based on a Word2Vec model, which achieved an accuracy of 62.9%.…”
Section: Arxiv:210702759v1 [Cscl] 23 Jun 2021mentioning
confidence: 99%
See 1 more Smart Citation
“…Although the high accuracy reported in the study, the methodology would not be accurate to model text data written in more formal scenarios such as customer reviews, product surveys, opinion posts, and customer service chats, which have a different structure compared to the texts that can be found in social media data. In other study [11], the authors proposed a system to classify the gender of the persons who wrote 100,000 posts from Weibo (Chinese social network similar to Tweeter). The system was based on a Word2Vec model, which achieved an accuracy of 62.9%.…”
Section: Arxiv:210702759v1 [Cscl] 23 Jun 2021mentioning
confidence: 99%
“…There are some studies focused on gender classification using Deep Learning (DL) methods. However, when considering texts in Spanish, the number of studies is relatively small [9,11]. In [10], the authors proposed a methodology based on Bidirectional Gated Recurrent Units (GRUs) and an attention mechanism for gender classification in the PAN17 corpus.…”
Section: Arxiv:210702759v1 [Cscl] 23 Jun 2021mentioning
confidence: 99%
“…Existing approaches utilized statistical features [115] and seldom involved background knowledge along with social information. In [116], a dataset from Sina Weibo, which is a counterpart of the micro-blogging platform Twitter, in China, was used to assess their methodology for gender prediction. [117] exploits online behavioral and textual features and choice of vocabulary for each user.…”
Section: Gender Estimation Using Twittermentioning
confidence: 99%