2019
DOI: 10.1109/access.2019.2922162
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Streaming Data Clustering Algorithm Based on Fitness Proportionate Sharing

Abstract: As an unsupervised learning technique, clustering can effectively capture the patterns in a data stream based on similarities among the data. Traditional data stream clustering algorithms either heavily depend on some prior knowledge or predefined parameters while the characteristics of real-time data are considered unknown. Besides, the user-specified threshold is used to overcome the effect of outliers and noises, which significantly affects the clustering performance. The overlap among clusters is another m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 58 publications
0
12
0
Order By: Relevance
“…Compared to CFSFDP, the proposed algorithm can obtain d c automatically and use sparse learning to determine the neighbors of each data point, removing irrelevant data points at the same time. Yan et al [35] proposed a new clustering algorithm based on fitness proportionate sharing to map the problem into a multimodal optimization problem. The individuals with the highest density values were the cluster centers, the fitness proportionate sharing strategy was implemented in the identification results to overcome the sensitivity of uneven density values of cluster centers.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Compared to CFSFDP, the proposed algorithm can obtain d c automatically and use sparse learning to determine the neighbors of each data point, removing irrelevant data points at the same time. Yan et al [35] proposed a new clustering algorithm based on fitness proportionate sharing to map the problem into a multimodal optimization problem. The individuals with the highest density values were the cluster centers, the fitness proportionate sharing strategy was implemented in the identification results to overcome the sensitivity of uneven density values of cluster centers.…”
Section: Related Workmentioning
confidence: 99%
“…The sub-trajectory segments are clustered based on different cutoff distances. The value range of d c in the city 1 trajectory dataset is [20], [30], and the value range of d c in the city 2 trajectory dataset is [25], [35]. Comparing the trend of the number of clusters corresponding to different cutoff distance values, the results are shown in Figure 12.…”
Section: B Evaluation Of High-density Sub-trajectory Clustering Algomentioning
confidence: 99%
“…The task scheduling in edge cloud often has a large number of tasks. Clustering algorithm can classify tasks and maximize the similarity between data samples within the same cluster [23] . Ref.…”
Section: Related Workmentioning
confidence: 99%
“…Most of these studies in literature focused on rigorous data analysis that involves feature selection [10], [11], and statistical feature formation techniques. These processes are time-consuming, and some features may only exist in some vehicles, which limits the practicability of the work.…”
Section: Related Workmentioning
confidence: 99%