2023
DOI: 10.3389/frai.2022.1055294
|View full text |Cite
|
Sign up to set email alerts
|

Qluster: An easy-to-implement generic workflow for robust clustering of health data

Abstract: The exploration of heath data by clustering algorithms allows to better describe the populations of interest by seeking the sub-profiles that compose it. This therefore reinforces medical knowledge, whether it is about a disease or a targeted population in real life. Nevertheless, contrary to the so-called conventional biostatistical methods where numerous guidelines exist, the standardization of data science approaches in clinical research remains a little discussed subject. This results in a significant vari… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 106 publications
0
2
0
Order By: Relevance
“…To evaluate the validity and reliability of identified clusters, the Jaccard coefficient, which measures the similarity of two subsets on a cluster based on their cluster classification, will be utilized [ 61 , 62 ]. Using bootstrap sampling, subsets of the data will be randomly selected, and KAMILA will be applied to each subset.…”
Section: Methodsmentioning
confidence: 99%
“…To evaluate the validity and reliability of identified clusters, the Jaccard coefficient, which measures the similarity of two subsets on a cluster based on their cluster classification, will be utilized [ 61 , 62 ]. Using bootstrap sampling, subsets of the data will be randomly selected, and KAMILA will be applied to each subset.…”
Section: Methodsmentioning
confidence: 99%
“…Health data encompasses information about an individual's or a population's health conditions, health outcomes, and quality of life ( 1 ). They include clinical, environmental, socioeconomic, and behavioral data relevant to health and wellness ( 2 ).…”
Section: Introductionmentioning
confidence: 99%