BackgroundIdentifying dementia early in time, using real world data, is a public health challenge. As only two-thirds of people with dementia now ultimately receive a formal diagnosis in United Kingdom health systems and many receive it late in the disease process, there is ample room for improvement. The policy of the UK government and National Health Service (NHS) is to increase rates of timely dementia diagnosis. We used data from general practice (GP) patient records to create a machine-learning model to identify patients who have or who are developing dementia, but are currently undetected as having the condition by the GP.MethodsWe used electronic patient records from Clinical Practice Research Datalink (CPRD). Using a case-control design, we selected patients aged >65y with a diagnosis of dementia (cases) and matched them 1:1 by sex and age to patients with no evidence of dementia (controls). We developed a list of 70 clinical entities related to the onset of dementia and recorded in the 5 years before diagnosis. After creating binary features, we trialled machine learning classifiers to discriminate between cases and controls (logistic regression, naïve Bayes, support vector machines, random forest and neural networks). We examined the most important features contributing to discrimination.ResultsThe final analysis included data on 93,120 patients, with a median age of 82.6 years; 64.8% were female. The naïve Bayes model performed least well. The logistic regression, support vector machine, neural network and random forest performed very similarly with an AUROC of 0.74. The top features retained in the logistic regression model were disorientation and wandering, behaviour change, schizophrenia, self-neglect, and difficulty managing.ConclusionsOur model could aid GPs or health service planners with the early detection of dementia. Future work could improve the model by exploring the longitudinal nature of patient data and modelling decline in function over time.
ObjectiveTo determine comparative fracture risk in HIV patients compared with uninfected controls.DesignA randomised cross-sectional study assessing bone mineral density (BMD), fracture history and risk factors in the 2 groups.SettingHospital Outpatients.Subbbjects222 HIV infected patients and an equal number of age-matched controls. Assessments: Fracture risk factors were assessed and biochemical, endocrine and bone markers measured. BMD was assessed at hip and spine. 10-year fracture probability (FRAX) and remaining lifetime fracture probability (RFLP) were calculated.Main Outcome MeasuresBMD, and history of fractures.ResultsReported fractures occurred more frequently in HIV than controls, (45 vs. 16; 20.3 vs. 7%; OR=3.27; p=0.0001), and unsurprisingly in this age range, non-fragility fractures in men substantially contributed to this increase. Osteoporosis was more prevalent in patients with HIV (17.6% vs. 3.6%, p<0.0001). BMD was most reduced, and predicted fracture rates most increased, at the spine. Low BMD was associated with antiretroviral therapy (ART), low body mass index and PTH. 10-year FRAX risk was <5% for all groups. RLFP was greater in patients with HIV (OR=1.22; p=0.003) and increased with ART (2.4 vs. 1.50; OR= 1.50; p=0.03). ConclusionsThe increased fracture rate in HIV patients in our relatively youthful population is partly driven by fractures, including non-fragility fractures, in men. Nonetheless, these findings may herald a rise in osteoporotic fractures in HIV patients. An appropriate screening and management response is required to assess these risks and identify associated lifestyle factors that are also associated with other conditions such as cardiovascular disease and diabetes.
Dementia is one of the most feared illnesses that has a growing year-to-year negative global impact, having a health and social care cost higher than cancer, stroke and chronic heart disease, taken together. Without the availability of a cure, nor a standardised clinical test, the utilisation of machine learning methods to identify individuals that are at risk of developing dementia could bring a new step towards proactive intervention. This study's goal is to carry out a precursor analysis leading to building classification models with enhanced capabilities for differentiating diagnoses of CN (Cognitively Normal), MCI (Mild Cognitive Impairment) and Dementia. The predictive modelling approach we propose is based on the ReliefF method combined with statistical permutation tests for feature selection, and on model training, tuning, and testing based on algorithms such as Random Forests, Support Vector Machines, Gaussian Processes, Stochastic Gradient Boosting, and eXtreme Gradient Boosting. Stability of model performances were studied in computationally intensive Monte Carlo simulations. The results consistently show that our models accurately detect dementia, and also mild cognitive impairment patients by only using the inclusion of baseline measurements as predictors, thus illustrating the importance of baseline measurements. The best results issued from Monte Carlo were achieved by eXtreme Gradient Boosting optimised models, with an accuracy of 0.88 (SD 0.02), a sensitivity of 0.93 (SD 0.02) and a specificity of 0.94 (SD 0.01) for dementia, and a sensitivity of 0.86 (SD 0.02) and a specificity of 0.9 (SD 0.02) for mild cognitive impairment. These results support in particular future developments for a riskbased method that can identify an individual's risk of developing dementia.
Approaches to knowledge extraction (KE) in the health domain often start by annotating text to indicate the knowledge to be extracted, and then use the annotated text to train systems to perform the KE. This may work for annotating named entities or other contiguous noun phrases (drugs, some drug effects), but becomes increasingly difficult when items tend to be expressed across multiple, possibly noncontiguous, syntactic constituents (e.g. most descriptions of drug effects in user-generated text). Other issues include that it is not always clear how annotations map to actionable insights, or how they scale up to, or can form part of, more complex KE tasks. This paper reports our efforts in developing an approach to extracting knowledge about drug nonadherence from health forums which led us to conclude that development cannot proceed in separate steps but that all aspects-from conceptualisation to annotation scheme development, annotation, KE system training and knowledge graph instantiation-are interdependent and need to be co-developed. Our aim in this paper is two-fold: we describe a generally applicable framework for developing a KE approach, and present a specific KE approach, developed with the framework, for the task of gathering information about antidepressant drug nonadherence. We report the conceptualisation, the annotation scheme, the annotated corpus, and an analysis of annotated texts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.