2017
DOI: 10.1142/s0218001418500039
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Clustering-Based Sampling Approach for Minimum Sample Set in Big Data Environment

Abstract: The data are rapidly expanding nowadays, which makes it very difficult to analyze valuable information from big data. Most of the existing data mining algorithms deal with big data problems at large time and space costs. This paper focuses on the sampling problem of big data and puts forward an efficient heuristic Cluster Sampling Arithmetic, called CSA. Many of the former researchers adopted random method to extract early sample set from the original data and then made a variety of different processing of the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
15
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 10 publications
0
15
0
Order By: Relevance
“…In the big data era, popular technologies like cloud platforms, cloud computing and data warehouse emerge as per today's requirement. 10,28 In a word, it is possible to build a clear correlation among driving data, driving behaviors and driving safety with an e®ective utilization and integration of various vehicles data through data mining, which also provides technological services in driving safety for intelligent vehicles in the future. 25,26 Meanwhile, the Society of Automotive Engineers (SAE) has divided vehicle automation into six levels.…”
Section: Introductionmentioning
confidence: 99%
“…In the big data era, popular technologies like cloud platforms, cloud computing and data warehouse emerge as per today's requirement. 10,28 In a word, it is possible to build a clear correlation among driving data, driving behaviors and driving safety with an e®ective utilization and integration of various vehicles data through data mining, which also provides technological services in driving safety for intelligent vehicles in the future. 25,26 Meanwhile, the Society of Automotive Engineers (SAE) has divided vehicle automation into six levels.…”
Section: Introductionmentioning
confidence: 99%
“…Because comprehensive metrics c and distances between potential clustering centers are always related [41], they can be integrated to automatically identifying clustering centers.…”
Section: Principles and Algorithm Of Acdpcmentioning
confidence: 99%
“…We improved the algorithm proposed by Zhao [41,42] to recognize the clustering centers. Discriminant distance φ T i is defined as…”
Section: Principles and Algorithm Of Acdpcmentioning
confidence: 99%
See 1 more Smart Citation
“…Clustering has been applied in a wide variety of¯elds, ranging from engineering (machine learning, arti¯cial intelligence, pattern recognition, mechanical engineering, electrical engineering), 2 computer sciences (web mining, spatial database analysis, textual document collection, image segmentation), 14 life and medical sciences (genetics, biology, microbiology, psychiatry, clinic, pathology), to earth sciences (geography, geology, remote sensing), social sciences (sociology, psychology, education), 16 and economics (marketing, business). 7,10,23,27 Traditional methods of clustering can be broadly categorized into those of hierarchical, partitioning, density-based, model-based, grid-based, and softcomputing. 17,19,21,25 Inspired by DBSCAN, 6 many density-based clustering methods 5,6,20 have been proposed.…”
Section: Introductionmentioning
confidence: 99%