Proceedings of the 26th ACM Conference on Hypertext &Amp; Social Media - HT '15 2015
DOI: 10.1145/2700171.2791031
|View full text |Cite
|
Sign up to set email alerts
|

An Interactive Method for Inferring Demographic Attributes in Twitter

Abstract: Twitter data offers an unprecedented opportunity to study demographic differences in public opinion across a virtually unlimited range of subjects. Whilst demographic attributes are often implied within user data, they are not always easily identified using computational methods. In this paper, we present a semi-automatic solution that combines automatic classification methods with a user interface designed to enable rapid resolution of ambiguous cases. TweetClass employs a two-step, interactive process to sup… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(12 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…Dredze et al [ 35 ] state that geo-specific data coupled with the public forum nature of social media (which encourages the sharing of detailed information) creates new public health capabilities not previously seen. Simultaneously, advances in demographic extraction techniques and computational linguistics have allowed for a deeper understanding of user demographics [ 37 , 38 ]. In these studies, Beretta and Burger connected age and gender to linguistic patterns (often word usage).…”
Section: Introductionmentioning
confidence: 99%
“…Dredze et al [ 35 ] state that geo-specific data coupled with the public forum nature of social media (which encourages the sharing of detailed information) creates new public health capabilities not previously seen. Simultaneously, advances in demographic extraction techniques and computational linguistics have allowed for a deeper understanding of user demographics [ 37 , 38 ]. In these studies, Beretta and Burger connected age and gender to linguistic patterns (often word usage).…”
Section: Introductionmentioning
confidence: 99%
“…As we discussed earlier, textual information is rich in content. It can leak users' privacy by allowing users' in the textual database to be re-identified [50] and leaking their private attribute information [15,43]. Our focus in this paper is to design an effective text anonymiztion technique for the data publisher to preserve users' privacy by preventing a potential adversary (i.e., malicious data consumer) from breaching privacy of users while maintaining the utility of their textual information for future tasks.…”
Section: Problem Statementmentioning
confidence: 99%
“…Today starting counting calories #myfitnesspal and juicing for dinner 2 This user may not be aware that the sensitive medical condition information can be easily inferred from this post-exposing symptoms of Diabetes. If intact users' textual data is available, a malicious data consumer (or any potential adversary) can easily infer lots of sensitive and private information from text that users' do not explicitly disclose such as vacation plans, medical conditions, age and location [15,31]. Another privacy issue arises when a malicious data consumer attempts to re-identify the identity of an individual in the database by investigating whether a targeted user's textual data is in the database or inferring which record is associated with it.…”
Section: Introductionmentioning
confidence: 99%
“…Many build learning models that are applicable across a variety of different user attributes (Chen et al, 2015a;Volkova et al, 2015;Beretta et al, 2015). Among the attributes, political preferences is a frequent area of research, again relying on features from user tweets, and making use of graph-based algorithms over their friends' attributes (Golbeck and Hansen, 2011;Conover et al, 2011;Wong et al, 2013;Cohen and Ruths, 2013;Volkova et al, 2014).…”
Section: Previous Workmentioning
confidence: 99%