2019
DOI: 10.3390/ijgi8020082
|View full text |Cite
|
Sign up to set email alerts
|

A Varied Density-based Clustering Approach for Event Detection from Heterogeneous Twitter Data

Abstract: Extracting the latent knowledge from Twitter by applying spatial clustering on geotagged tweets provides the ability to discover events and their locations. DBSCAN (density-based spatial clustering of applications with noise), which has been widely used to retrieve events from geotagged tweets, cannot efficiently detect clusters when there is significant spatial heterogeneity in the dataset, as it is the case for Twitter data where the distribution of users, as well as the intensity of publishing tweets, varie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 27 publications
(12 citation statements)
references
References 50 publications
0
12
0
Order By: Relevance
“…Although K-Means is very fast and mostly used for clustering, it requires one to define the number of clusters as a parameter to the algorithm. Among the existing clustering approaches, a density-based algorithm particularly DBSCAN (density-based spatial clustering with noise) and its variations, is more efficient for detecting clusters with arbitrary shapes from the noisy dataset where there is no prior knowledge about the number of clusters (Ghaemi and Farnaghi, 2019;Liu et al, 2007). Many improved versions of this algorithm are also available (such as NG-DBSCAN (Lulli et al, 2016)) to overcome the scalability issues of densitybased clustering, but they fail to address the ineffectiveness of density-based approaches in sparse data setting.…”
Section: Related Workmentioning
confidence: 99%
“…Although K-Means is very fast and mostly used for clustering, it requires one to define the number of clusters as a parameter to the algorithm. Among the existing clustering approaches, a density-based algorithm particularly DBSCAN (density-based spatial clustering with noise) and its variations, is more efficient for detecting clusters with arbitrary shapes from the noisy dataset where there is no prior knowledge about the number of clusters (Ghaemi and Farnaghi, 2019;Liu et al, 2007). Many improved versions of this algorithm are also available (such as NG-DBSCAN (Lulli et al, 2016)) to overcome the scalability issues of densitybased clustering, but they fail to address the ineffectiveness of density-based approaches in sparse data setting.…”
Section: Related Workmentioning
confidence: 99%
“…However, this model only used the textual information and not the localization and temporal information available from the tweets. Similarly, in 2019 Ghaemi and Farnaghi [42] proposed, VCDT (Varied Density-based spatial Clustering for Twitter data), an improvement on DBSCAN by incorporating spatial heterogeneity by identifying clusters of varying spatial densities. They considered the textual as well as spatial features of the Twitter data to detect the spatially clustered events.…”
Section: Clustering Based Approachesmentioning
confidence: 99%
“…However, in actual line loss work, unqualified data may be produced in the process of collecting, transferring, and storing data. To address the issue, we applied the DBSCAN algorithm to cluster the data, which can realize anomaly detection [40,41]. DBSCAN is a clustering algorithm that, given a set of points in some space, groups together points that are closely packed together, marking points that lie alone in low-density regions as outliers.…”
Section: Distribution Network Classification Based On Dbscanmentioning
confidence: 99%