2020
DOI: 10.1371/journal.pcbi.1007866
|View full text |Cite
|
Sign up to set email alerts
|

Learning clinical networks from medical records based on information estimates in mixed-type data

Abstract: The precise diagnostics of complex diseases require to integrate a large amount of information from heterogeneous clinical and biomedical data, whose direct and indirect interdependences are notoriously difficult to assess. To this end, we propose an efficient computational approach to simultaneously compute and assess the significance of multivariate information between any combination of mixed-type (continuous/categorical) variables. The method is then used to uncover direct, indirect and possibly causal rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 19 publications
(39 citation statements)
references
References 32 publications
0
39
0
Order By: Relevance
“…Support vector machine ( 15 , 31 ) and topic modeling ( 19 , 24 ) were used by two studies each. Finally, lasso ( 33 ), naïve Bayes ( 28 ), multivariate information-based inductive causation (MIIC) network learning algorithm ( 26 ), fuzzy c-means cluster analysis ( 23 ), conditional restricted Boltzmann machine ( 27 ), WEKA ( 15 ), and gradient boosted trees ( 33 ) were applied by one study each.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Support vector machine ( 15 , 31 ) and topic modeling ( 19 , 24 ) were used by two studies each. Finally, lasso ( 33 ), naïve Bayes ( 28 ), multivariate information-based inductive causation (MIIC) network learning algorithm ( 26 ), fuzzy c-means cluster analysis ( 23 ), conditional restricted Boltzmann machine ( 27 ), WEKA ( 15 ), and gradient boosted trees ( 33 ) were applied by one study each.…”
Section: Resultsmentioning
confidence: 99%
“…Major neurocognitive disorder, dementia, and Alzheimer's disease were the most common conditions among the articles included in our study and were reported in 11 studies ( 7 , 15 , 16 , 24 , 25 , 27 , 28 , 30 , 31 , 33 , 34 ). Among the other outcomes, a geriatric syndrome (falls, malnutrition, dementia, severe urinary control issues, absence of fecal control, visual impairment, walking difficulty, pressure ulcers, lack of social support, and weight loss) in 3 studies ( 18 , 21 , 22 ), delirium in 2 studies ( 20 , 32 ), mild cognitive impairment ( 33 ), cognitive disorder ( 26 ), multimorbidity pattern ( 23 ), mortality ( 29 ), hospital admission ( 17 ), and themes/topics mentioned in care providers' notes ( 19 ) were considered once. Outcomes were primarily defined using ICD diagnosis codes in AHD and EHR.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…As analyses, such as WGCNA, are optimally suited to identify correlated networks, but do not identify direct gene-to-gene interactions or potential causal links, we used miic analysis (Sella et al, 2018;Verny et al, 2017), recently extended to analyze continuous or mixed-type (e.g., continuous-categorical) data (Cabeli et al, 2020). The miic algorithm identifies direct paths with high confidence, and, as such, reveals potential cause-and-effect links as well as latent regulators, which may not show up when looking only at gene expression data.…”
Section: Discussionmentioning
confidence: 99%
“…While estimating (conditional) mutual information for purely discrete or continuous data is a well-studied problem [4,5,8,10,22], many real-world settings concern a mix of discrete and continuous random variables, such as age (in years) and height, or even random variables that can individually consist of a mixture of discrete and continuous components. Although several discretization-based approaches that can estimate MI for a mix of discrete and continuous random variables have recently emerged [2,17,31], so far only methods based on k-nearest neighbour (kNN) estimation were shown to work on mixed variables, which may consist of discrete-continuous mixture variables [7,19,24].…”
Section: Introductionmentioning
confidence: 99%