Increasing tendency of urine protein is a risk factor for rapid eGFR decline in patients with CKD: A machine learning-based prediction model by using a big database

Inaguma, Daijo; Kitagawa, Akimitsu; Yanagiya, Ryosuke; Koseki, Akira; Iwamori, Toshiya; Kudo, Mineichi

doi:10.1371/journal.pone.0239262

Cited by 18 publications

(13 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Traditionally, gold-standard labels are annotated by manual review of patient records [37,85,116]. Labels have also been derived from registry data [33], laboratory results [61,112,117], diagnosis codes [30,57,58,118–120], and rule-based algorithms [59,121–123] to enable more rapid development of labeled datasets. The most commonly used methods for classifying a binary phenotype are random forest [26,28,35,37,56,57,60,62,70,81,84,117,119,120,124–126], logistic regression [36,37,57,58,60,67,82,84,93,116,117,119,125,127,128], and support vector machine (SVM) [31,35,37,58,60,81,82,84,92,97,104,116,125,126] (Supplementary Material Table S12 ).…”

Section: Resultsmentioning

confidence: 99%

Machine Learning Approaches for Electronic Health Records Phenotyping: A Methodical Review

Yang

Varghese²,

Stephenson

et al. 2022

Preprint

View full text Add to dashboard Cite

ObjectiveAccurate and rapid methods for phenotyping are a prerequisite to realizing the potential of electronic health records (EHRs) data for clinical and translational research. This study reviews the literature on machine learning (ML) approaches for phenotyping with respect to the phenotypes considered, the data sources and methods used, and the contributions within the wider context of EHR-based research.Materials and MethodsWe searched for relevant articles in PubMed and Web of Science published between January 1, 2018 and April 14, 2022. After screening, we collected data on 52 variables across 106 selected articles.ResultsML-based methods were developed for 156 unique phenotypes, primarily using EHR data from a single institution or health system. 72 of 106 articles leveraged unstructured data in clinical notes. In terms of methodology, supervised learning is the most prevalent ML paradigm (n = 64, 60.4%), with half of the articles employing deep learning. Semi-supervised and weakly-supervised approaches were applied to reduce the burden of obtaining gold-standard labeled data (n = 21, 19.8%), while unsupervised learning was used for phenotype discovery (n = 20, 18.9%). Federated learning has been applied to develop algorithms across multiple institutions while preserving data privacy (n = 2, 1.9%).DiscussionWhile the use of ML for phenotyping is growing, most articles applied traditional supervised ML to characterize the presence of common, chronic conditions.ConclusionContinued research in ML-based methods is warranted, with particular attention to the development of advanced methods for complex phenotypes and standards for reporting and evaluating phenotyping algorithms.

show abstract

Section: Resultsmentioning

confidence: 99%

Machine Learning Approaches for Electronic Health Records Phenotyping: A Methodical Review

Yang

Varghese²,

Stephenson

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…We created a prediction model for identifying patients with rapid eGFR decline among those with CKD in our previous study. 19 However, we could not adjust the models and stratify them according to eGFR. Machine learning enables the classification of trajectories of eGFR decline into nine patterns using eGFR at baseline and the rate of eGFR decline.…”

Section: Discussionmentioning

confidence: 99%

“…Contrary to expectations, proteinuria did not significantly influence the prediction, as opposed to our previous report on a prediction model for rapid eGFR decline. 19 We believe that this was because patients with extremely rapid eGFR decline who were at a higher risk were enrolled. A relatively high amount of proteinuria was already recognised in the patients in this study.…”

Section: Discussionmentioning

confidence: 99%

“…Our hospital has maintained a large database of more than 900 000 patients who were treated and followed up for various diseases since 2004. We previously demonstrated a prediction model for patients with an eGFR decline of ≥30% within 2 years using machine learning 19. However, we realised that the patterns of eGFR decline varied even among those patients.…”

Section: Introductionmentioning

confidence: 94%

“…We previously demonstrated a prediction model for patients with an eGFR decline of ≥30% within 2 years using machine learning. 19 However, we realised that the patterns of eGFR decline varied even among those patients. A more detailed prediction is crucial in the real-world clinical settings because of higher risks of progression to end-stage kidney disease and the incidence of CV disease.…”

Section: Introductionmentioning

confidence: 94%

See 2 more Smart Citations

Development of a machine learning-based prediction model for extremely rapid decline in estimated glomerular filtration rate in patients with chronic kidney disease: a retrospective cohort study using a large data set from a hospital in Japan

et al. 2022

Self Cite

View full text Add to dashboard Cite

ObjectivesTrajectories of estimated glomerular filtration rate (eGFR) decline vary highly among patients with chronic kidney disease (CKD). It is clinically important to identify patients who have high risk for eGFR decline. We aimed to identify clusters of patients with extremely rapid eGFR decline and develop a prediction model using a machine learning approach.DesignRetrospective single-centre cohort study.SettingsTertiary referral university hospital in Toyoake city, Japan.ParticipantsA total of 5657 patients with CKD with baseline eGFR of 30 mL/min/1.73 m2 and eGFR decline of ≥30% within 2 years.Primary outcomeOur main outcome was extremely rapid eGFR decline. To study-complicated eGFR behaviours, we first applied a variation of group-based trajectory model, which can find trajectory clusters according to the slope of eGFR decline. Our model identified high-level trajectory groups according to baseline eGFR values and simultaneous trajectory clusters. For each group, we developed prediction models that classified the steepest eGFR decline, defined as extremely rapid eGFR decline compared with others in the same group, where we used the random forest algorithm with clinical parameters.ResultsOur clustering model first identified three high-level groups according to the baseline eGFR (G1, high GFR, 99.7±19.0; G2, intermediate GFR, 62.9±10.3 and G3, low GFR, 43.7±7.8); our model simultaneously found three eGFR trajectory clusters for each group, resulting in nine clusters with different slopes of eGFR decline. The areas under the curve for classifying the extremely rapid eGFR declines in the G1, G2 and G3 groups were 0.69 (95% CI, 0.63 to 0.76), 0.71 (95% CI 0.69 to 0.74) and 0.79 (95% CI 0.75 to 0.83), respectively. The random forest model identified haemoglobin, albumin and C reactive protein as important characteristics.ConclusionsThe random forest model could be useful in identifying patients with extremely rapid eGFR decline.Trial registrationUMIN 000037476; This study was registered with the UMIN Clinical Trials Registry.

show abstract

Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review

et al. 2023

View full text Add to dashboard Cite

Objectives In this systematic review we aimed at assessing how artificial intelligence (AI), including machine learning (ML) techniques have been deployed to predict, diagnose, and treat chronic kidney disease (CKD). We systematically reviewed the available evidence on these innovative techniques to improve CKD diagnosis and patient management. Methods We included English language studies retrieved from PubMed. The review is therefore to be classified as a “rapid review”, since it includes one database only, and has language restrictions; the novelty and importance of the issue make missing relevant papers unlikely. We extracted 16 variables, including: main aim, studied population, data source, sample size, problem type (regression, classification), predictors used, and performance metrics. We followed the Preferred Reporting Items for Systematic Reviews (PRISMA) approach; all main steps were done in duplicate. The review was registered on PROSPERO. Results From a total of 648 studies initially retrieved, 68 articles met the inclusion criteria. Models, as reported by authors, performed well, but the reported metrics were not homogeneous across articles and therefore direct comparison was not feasible. The most common aim was prediction of prognosis, followed by diagnosis of CKD. Algorithm generalizability, and testing on diverse populations was rarely taken into account. Furthermore, the clinical evaluation and validation of the models/algorithms was perused; only a fraction of the included studies, 6 out of 68, were performed in a clinical context. Conclusions Machine learning is a promising tool for the prediction of risk, diagnosis, and therapy management for CKD patients. Nonetheless, future work is needed to address the interpretability, generalizability, and fairness of the models to ensure the safe application of such technologies in routine clinical practice. Graphical abstract

show abstract

Increasing tendency of urine protein is a risk factor for rapid eGFR decline in patients with CKD: A machine learning-based prediction model by using a big database

Cited by 18 publications

References 34 publications

Machine Learning Approaches for Electronic Health Records Phenotyping: A Methodical Review

Machine Learning Approaches for Electronic Health Records Phenotyping: A Methodical Review

Development of a machine learning-based prediction model for extremely rapid decline in estimated glomerular filtration rate in patients with chronic kidney disease: a retrospective cohort study using a large data set from a hospital in Japan

Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review

Contact Info

Product

Resources

About