2018
DOI: 10.1101/256396
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Estimating variance components in population scale family trees

Abstract: The rapid digitization of genealogical and medical records enables the assembly of extremely large pedigree records spanning millions of individuals. Such pedigrees provide the opportunity to answer genetic and epidemiological questions in scales much larger than previously possible. Linear mixed models (LMMs) are often used for analysis of pedigree data. However, LMMs cannot naturally scale to large pedigrees spanning millions of individuals, owing to their steep computational and storage requirements. Here w… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 102 publications
(128 reference statements)
0
9
0
Order By: Relevance
“…Examples of such databases include the Utah Population database and a number of reliable Icelandic databases 51,52 . The availability of population-scale family trees 53 would further allow the determination of the parental origin of alleles 54 and generally enable large scale human population studies on epidemiological history 55 . Overall, the reliability of data and its impact on T1D and RA diagnoses must be discussed.…”
Section: Discussionmentioning
confidence: 99%
“…Examples of such databases include the Utah Population database and a number of reliable Icelandic databases 51,52 . The availability of population-scale family trees 53 would further allow the determination of the parental origin of alleles 54 and generally enable large scale human population studies on epidemiological history 55 . Overall, the reliability of data and its impact on T1D and RA diagnoses must be discussed.…”
Section: Discussionmentioning
confidence: 99%
“…We computed heritability with the Sci-LMM Python package, which constructs and works with large scale relationships matrices and fits them to the corresponding LMM within several hours. Our Identity By Descent (IBD) matrix (an identity-by-descent relationships based matrix) was the ARM computed from the entire pedigree ( 27 ) . We used Haseman-Elston regression to compute the heritability measure , and we estimate the h 2 standard error via the average information restricted maximum likelihood (AI-REML) procedure ( 44 , 45 ) .…”
Section: Heritability Estimationmentioning
confidence: 99%
“…Today, heritability estimation is typically performed using Linear Mixed Models (LMMs) ( 26 ) and requires a kinship matrix, commonly computed from genetic information. Using the Sparse Cholesky Factorization (Sci-LMM) package ( 27 ) , a statistical modeling framework for analyzing population-size pedigrees, we constructed a relationship matrix and fitted the corresponding LMM solely from EHR information, without costly genetic testing. We estimated the heritability of absolute HbA1c% reduction in response to metformin to be 15.9% (95% CI 1.2%-30.5%) in males and 20.9% (95% CI 7.5%-34.3%) in females of the total explained variability.…”
Section: Introductionmentioning
confidence: 99%
“…To reduce the computational complexity of genetic similarity matrix operations, several research programs have been conducted to store and manipulate sparse representations of the genetic similarity matrix (Shor et al, 2019 ). In these representations, researchers set a threshold and then zero out elements of the genetic similarity matrix with absolute value less than this threshold.…”
Section: Introductionmentioning
confidence: 99%