2022
DOI: 10.1007/s10462-022-10254-w
|View full text |Cite
|
Sign up to set email alerts
|

Short text topic modelling approaches in the context of big data: taxonomy, survey, and analysis

Abstract: Social media platforms such as (Twitter, Facebook, and Weibo) are being increasingly embraced by individuals, groups, and organizations as a valuable source of information. This social media generated information comes in the form of tweets or posts, and normally characterized as short text, huge, sparse, and low density. Since many real-world applications need semantic interpretation of such short texts, research in Short Text Topic Modeling (STTM) has recently gained a lot of interest to reveal unique and co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 38 publications
(16 citation statements)
references
References 262 publications
0
16
0
Order By: Relevance
“…On the other hand, topic extraction, or topic modeling, is defned as the technique used to infer conceptual topics hidden in a set of documents, or corpus [22,24], where there is no explicit taxonomic scheme to project onto a corpus or when such projection (or labeling) is costly [25]. In another defnition, topic modeling is an automated process to defne the "latent thematic structure" of a corpus, summarizing the texts into topics or categorizing them into labels [26].…”
Section: Methodsmentioning
confidence: 99%
“…On the other hand, topic extraction, or topic modeling, is defned as the technique used to infer conceptual topics hidden in a set of documents, or corpus [22,24], where there is no explicit taxonomic scheme to project onto a corpus or when such projection (or labeling) is costly [25]. In another defnition, topic modeling is an automated process to defne the "latent thematic structure" of a corpus, summarizing the texts into topics or categorizing them into labels [26].…”
Section: Methodsmentioning
confidence: 99%
“…Many natural language processing, machine learning, and topic modeling techniques have been developed to uncover information from unstructured data (Murshed et al, 2023 ). Most of these techniques require programming skills, but improvements in the user-friendly coding software have enabled their widespread use (Yu and Egger, 2021 ).…”
Section: Introductionmentioning
confidence: 99%
“…Several terms, such as slang and abbreviations often seen in tweets, are absent from dictionaries (Kolajo et al, 2022). • Short and unstructured tweets: This issue makes it difficult to interpret the semantics of short informal writings on Twitter (Lilleberg et al, 2015;Murshed et al, 2023) . The shortness of tweets sometimes results in inadequate context, making determining a post's mood or genuine intention difficult.…”
Section: Introductionmentioning
confidence: 99%