2021
DOI: 10.2196/27113
|View full text |Cite
|
Sign up to set email alerts
|

Learning From Clinical Consensus Diagnosis in India to Facilitate Automatic Classification of Dementia: Machine Learning Study

Abstract: Background The Harmonized Diagnostic Assessment of Dementia for the Longitudinal Aging Study in India (LASI-DAD) is the first and only nationally representative study on late-life cognition and dementia in India (n=4096). LASI-DAD obtained clinical consensus diagnosis of dementia for a subsample of 2528 respondents. Objective This study develops a machine learning model that uses data from the clinical consensus diagnosis in LASI-DAD to support the clas… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 58 publications
1
10
0
Order By: Relevance
“…In this study, we set the random seed at 234 with the set.seed function to separate the training and testing sets, and the microarrays were randomly divided into the training and testing sets in a ratio of 8:2. In fact, we found that the segmentation of the training and testing data has different division standards in the different articles, such as 90:10 [ 48 ], 85:15 [ 49 ], 80:20, 70:30 [ 50 ], 60:40 [ 51 ], and so on. We choose 8:2 for three reasons: first, it is supported by the reported literature; second, considering that the number of samples is not large enough, we wanted to make the training data for developing the model as large as possible; third, the ratio of the number of DILI samples and control samples is about 80:20.…”
Section: Discussionmentioning
confidence: 99%
“…In this study, we set the random seed at 234 with the set.seed function to separate the training and testing sets, and the microarrays were randomly divided into the training and testing sets in a ratio of 8:2. In fact, we found that the segmentation of the training and testing data has different division standards in the different articles, such as 90:10 [ 48 ], 85:15 [ 49 ], 80:20, 70:30 [ 50 ], 60:40 [ 51 ], and so on. We choose 8:2 for three reasons: first, it is supported by the reported literature; second, considering that the number of samples is not large enough, we wanted to make the training data for developing the model as large as possible; third, the ratio of the number of DILI samples and control samples is about 80:20.…”
Section: Discussionmentioning
confidence: 99%
“…About 60% of LASI-DAD participants received clinical consensus diagnoses of their dementia status [16]. A machine learning model has been developed and validated to expand the clinical consensus diagnosis of dementia to all LASI-DAD participants [17].…”
Section: Methodsmentioning
confidence: 99%
“…The ultimate model was a stochastic gradient boosting model with an area under receiver operating characteristics (ROC) curve of 0.94, indicating that the model has an almost perfect discriminative ability according to thresholds suggested in prior research [19]. Details of the machine learning model have been described elsewhere [17]. To maximize the amount of labeled data for analysis in this paper, both the clinical consensus diagnoses and the predicted diagnoses in the LASI-DAD study were used as the labeled data to develop the semi-supervised machine learning model.…”
Section: Dementia Assessment and Diagnosismentioning
confidence: 99%
See 1 more Smart Citation
“…[1][2][3] Estimators for agreement are also used in other fields, such as a performance metric in machine learning. 4,5 Agreement studies are useful when the outcome is unknown or subjective (eg, Alzheimer's disease diagnosis based on neuroimaging biomarkers or the optimal machine learning model). Multiple raters may be included in agreement studies.…”
Section: Introductionmentioning
confidence: 99%