2014
DOI: 10.1145/2651403
|View full text |Cite
|
Sign up to set email alerts
|

Integrating Social and Auxiliary Semantics for Multifaceted Topic Modeling in Twitter

Abstract: Microblogging platforms, such as Twitter, have already played an important role in recent cultural, social and political events. Discovering latent topics from social streams is therefore important for many downstream applications, such as clustering, classification or recommendation. However, traditional topic models that rely on the bag-of-words assumption are insufficient to uncover the rich semantics and temporal aspects of topics in Twitter. In particular, microblog content is often influenced by external… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
24
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(27 citation statements)
references
References 22 publications
0
24
0
Order By: Relevance
“…semantics, cooccurrences, representative terms) are retrieved and combined with the textual features. Criteria used to select textual tokens to be enriched are named entities (SAIF;HE;ALANI, 2012;ABEL et al, 2012a;GUCKELSBERGER;JANSSEN, 2015;VOSECKY et al, 2014), frequent or relevant terms measured using Term Frequency -Inverse Document Frequency (TF-IDF) (PACKER et al, 2012;PAULHEIM, 2013;VOSECKY et al, 2014), and location/time identification heuristics GUCKELSBERGER;JANSSEN, 2015;VOSECKY et al, 2014). For the contextual enrichment, different techniques and elements can be explored:…”
Section: List Of Figuresmentioning
confidence: 99%
See 3 more Smart Citations
“…semantics, cooccurrences, representative terms) are retrieved and combined with the textual features. Criteria used to select textual tokens to be enriched are named entities (SAIF;HE;ALANI, 2012;ABEL et al, 2012a;GUCKELSBERGER;JANSSEN, 2015;VOSECKY et al, 2014), frequent or relevant terms measured using Term Frequency -Inverse Document Frequency (TF-IDF) (PACKER et al, 2012;PAULHEIM, 2013;VOSECKY et al, 2014), and location/time identification heuristics GUCKELSBERGER;JANSSEN, 2015;VOSECKY et al, 2014). For the contextual enrichment, different techniques and elements can be explored:…”
Section: List Of Figuresmentioning
confidence: 99%
“…• External documents: context is provided by the content extracted from related web docu-ments, which can be identified through Uniform Resource Locators (URL) mentioned in the tweets (VOSECKY et al, 2014), or by selecting specific words in the messages (e.g. named entities, representative terms) to access, for example, related Wikipedia pages (GENC;SAKAMOTO;NICKERSON, 2011;ROSA et al, 2011).…”
Section: List Of Figuresmentioning
confidence: 99%
See 2 more Smart Citations
“…Twitter, Facebook, and microblogs [21], [22], [23] reveals the opinions of public and assessment of this social data [13], [14] is an emerging need in the applications like topics detection [1], [4], product promotion in business [4], political predictions [6], and health recommendations [2], [6]. Rapid urbanization is posing the number of public health-related problems, including accidents and injuries, healthcare disparities, and increasing disease burdens due to changes in lifestyle and nutrition, as well as increased environmental pollution.…”
Section: Introductionmentioning
confidence: 99%