2020
DOI: 10.1109/access.2020.3036043
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Sentence Embedding Based Topic Detection Method for Microblogs

Abstract: Topic detection is a difficult challenging task, especially when the exact number of topics is unknown. In this paper, we present a novel topic detection approach based on neural computing to detect topics in a microblogging dataset. We use an unsupervised neural sentence embedding model to map blogs to an embedding space. The proposed model is a weighted power mean sentence embedding model in which weights are calculated by a targeted attention mechanism. The experimental results show that our embedding model… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 45 publications
0
4
0
Order By: Relevance
“…It a technique that extracts various microblogging features they are emoticons, URLs, question marks, punctuation marks, all caps, hashtags, elongated words,user names, etc. Wan et al [ 45 ]. Hashtags: the number of hashtags.…”
Section: Proposed Methodology For Sentiment Analysis From Spam Smsmentioning
confidence: 99%
“…It a technique that extracts various microblogging features they are emoticons, URLs, question marks, punctuation marks, all caps, hashtags, elongated words,user names, etc. Wan et al [ 45 ]. Hashtags: the number of hashtags.…”
Section: Proposed Methodology For Sentiment Analysis From Spam Smsmentioning
confidence: 99%
“…There are also some researches on TDT around the improvement of classification or clustering algorithms. Wan et al [ 16 ] proposed a clustering algorithm RADBSCAN with relational awareness based on DBSCAN. This algorithm can determine the number of topics according to the dataset itself, and the topic recognition effect is better.…”
Section: Related Workmentioning
confidence: 99%
“…In the appendix we provide a detailed description of how we implemented this additional filter, along with a sample of text ranked from least to most related to economic concepts (Table 12). A number of recent works suggest using a similar approach where a final computational step is taken to classify text data, see Bao et al (2020), Wu et al (2020), Wan et al (2020) and Khan et al (2020). A small scale evaluation of the classification strategy (100 pictures and associated text) results in an estimated precision of 0.78 and estimated recall of 0.80.…”
Section: Selection Of Imagesmentioning
confidence: 99%