2016
DOI: 10.1016/j.eswa.2016.02.005
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting clustering algorithms in a multiple-level fashion: A comparative study in the medical care scenario

Abstract: Clustering real-world data is a challenging task, since many real-data collections are characterized by an inherent sparseness and variable distribution. An appealing domain that generates such data collections is the medical care scenario where collected data include a large cardinality of patient records and a variety of medical treatments usually adopted for a given disease pathology. This paper proposes a two-phase data mining methodology to iteratively analyze different dataset portions and locally identi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 25 publications
0
11
0
Order By: Relevance
“…In order to meet the needs of users, the system cannot only adapt to the analysis and processing needs of big data by optimizing the data processing program but also provide a good data platform for the upper-level analysis and processing by optimizing the bottom-level data organization structure. Because in a distributed environment, the data processing mode of the big data system has been transformed into "computing close to the data," and the location of the data block has a direct impact on the system load, which can directly affect the performance of analysis and processing [7].…”
Section: Related Workmentioning
confidence: 99%
“…In order to meet the needs of users, the system cannot only adapt to the analysis and processing needs of big data by optimizing the data processing program but also provide a good data platform for the upper-level analysis and processing by optimizing the bottom-level data organization structure. Because in a distributed environment, the data processing mode of the big data system has been transformed into "computing close to the data," and the location of the data block has a direct impact on the system load, which can directly affect the performance of analysis and processing [7].…”
Section: Related Workmentioning
confidence: 99%
“…Classification according to the International Statistical Classification of Diseases and Related Health Problems 10th Revision Chapter Description of Chapter according to ICD-10 classification N (%) Ref. 1 Certain infectious and parasitic diseases 2 (4 %) [ 40 , 50 ] 2 Neoplasms 8 (16 %) [ 7 , 20 , 28 , 32 , 37 , 41 , 44 , 46 ] 4 Endocrine, nutritional and metabolic diseases 10 (20 %) [ 22 , 23 , 29 , 30 , 47 , 49 , 55 , 57 , 59 , 60 ] 5 Mental, Behavioral and Neurodevelopmental disorders 2 (4 %) [ 27 , 35 ] 6 Diseases of the nervous system 2 (4 %) [ 6 , 31 ] 9 Diseases of the circulatory system 10 (20 %) [ 8 , 10 , 24 , 26 , 38 ...…”
Section: Resultsmentioning
confidence: 99%
“…Other approaches are focused on providing specific technological and algorithmic contributions, such as clustering techniques [16,17], simulator engines [18], and association rule mining [19]. Such works exploit data-mining techniques, often based on both supervised (e.g., classification and prediction models) and unsupervised learning (e.g., clustering), hence proposing hybrid models.…”
Section: General-purpose and Data-driven Solutionsmentioning
confidence: 99%