2018
DOI: 10.1101/335745
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Using Topic Modeling via Non-negative Matrix Factorization to Identify Relationships between Genetic Variants and Disease Phenotypes: A Case Study of Lipoprotein(a) (LPA)

Abstract: 23Genome-wide and phenome-wide association studies are commonly used to identify 24 important relationships between genetic variants and phenotypes. Most of these studies have 25 treated diseases as independent variables and suffered from heavy multiple adjustment burdens 26 due to the large number of genetic variants and disease phenotypes. In this study, we propose 27 using topic modeling via non-negative matrix factorization (NMF) for identifying associations 28 between disease phenotypes and genetic varian… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 24 publications
0
7
0
Order By: Relevance
“…A wide range of approaches applies tensor factorization techniques to extract phenotypes. [3,4,[15][16][17][18][19] incorporate various constraints (e.g., sparsity, non-negativity, integer) into regular tensor factorization to produce more clinically-meaningful phenotypes. [10,11] identify phenotypes and their temporal trends by using irregular tensor factorization based on PARAFAC2 [12]; yet, those approaches cannot model both dynamic and static features for meaningful phenotype extraction.…”
Section: Unsupervised Computational Phenotypingmentioning
confidence: 99%
“…A wide range of approaches applies tensor factorization techniques to extract phenotypes. [3,4,[15][16][17][18][19] incorporate various constraints (e.g., sparsity, non-negativity, integer) into regular tensor factorization to produce more clinically-meaningful phenotypes. [10,11] identify phenotypes and their temporal trends by using irregular tensor factorization based on PARAFAC2 [12]; yet, those approaches cannot model both dynamic and static features for meaningful phenotype extraction.…”
Section: Unsupervised Computational Phenotypingmentioning
confidence: 99%
“…Details of these technologies are beyond the scope of this paper. There are many approaches to extracting knowledge from free text, including named-entity extraction [39][40][41], topic modeling [42][43][44], and automatic text summarization [45][46][47]. Techniques such as clustering [48,49], frequent pattern identification [50,51], and rule extraction [52][53][54] have been used to extract knowledge from data.…”
Section: Available and Needed Technologiesmentioning
confidence: 99%
“…A wide range of approaches applies tensor factorization techniques to extract phenotypes. [12,2,13,3,14,15,16] incorporate various constraints (e.g., sparsity, non-negativity, integer) into regular tensor factorization to produce more clinically-meaningful phenotypes. [8,9] identify phenotypes and their temporal trends by using irregular tensor factorization based on PARAFAC2 [10]; yet, those approaches cannot model both dynamic and static features for meaningful phenotype extraction.…”
Section: Unsupervised Computational Phenotypingmentioning
confidence: 99%