Genome-wide association studies (GWAS) have identified thousands of variants associated with complex traits, but their biological interpretation often remains unclear. Most of these variants overlap with expression QTLs, indicating their potential involvement in regulation of gene expression. Here, we propose a transcriptome-wide summary statistics-based Mendelian Randomization approach (TWMR) that uses multiple SNPs as instruments and multiple gene expression traits as exposures, simultaneously. Applied to 43 human phenotypes, it uncovers 3,913 putatively causal gene–trait associations, 36% of which have no genome-wide significant SNP nearby in previous GWAS. Using independent association summary statistics, we find that the majority of these loci were missed by GWAS due to power issues. Noteworthy among these links is educational attainment-associated BSCL2 , known to carry mutations leading to a Mendelian form of encephalopathy. We also find pleiotropic causal effects suggestive of mechanistic connections. TWMR better accounts for pleiotropy and has the potential to identify biological mechanisms underlying complex traits.
Age at first sexual intercourse (AFS) and age at first birth (AFB) have implications for health and evolutionary fitness. In this genome-wide association study (AFS, N=387,338; AFB, N=542,901), we identify 371 SNPs, 11 sex-specific, with a 5-6% polygenic score (PGS) prediction. Heritability of AFB shifted from 9% [CI=4-14] for women born in 1940 to 22% [CI=19-25] in 1965. Signals are driven by the genetics of reproductive biology and externalising behaviour, with key genes related to follicle stimulating hormone (FSHB), implantation (ESR1), infertility, and spermatid differentiation. Our findings suggest that Polycystic Ovarian Syndrome may lead to later AFB, linking with infertility. Late AFB is associated with parental longevity, and reduced incidence of Type 2 Diabetes (T2D) and Cardiovascular disease (CAD). Higher childhood socioeconomic circumstances and those in the highest PGS decile (90%+) experience markedly later reproductive onset. Results are relevant for improving teenage and late-life health, for understanding longevity, and guiding experimentation into mechanisms of infertility.
Mitochondrial diseases are a heterogeneous group of disorders that can be caused by mutations in the nuclear or mitochondrial genome. Mitochondrial DNA (mtDNA) variants may exist in a state of heteroplasmy, where a percentage of DNA molecules harbor a variant, or homoplasmy, where all DNA molecules have the same variant. The relative quantity of mtDNA in a cell, or copy number (mtDNA-CN), is associated with mitochondrial function, human disease, and mortality. To facilitate accurate identification of heteroplasmy and quantify mtDNA-CN, we built a bioinformatics pipeline that takes whole genome sequencing data and outputs mitochondrial variants, and mtDNA-CN. We incorporate variant annotations to facilitate determination of variant significance. Our pipeline yields uniform coverage by remapping to a circularized chrM and by recovering reads falsely mapped to nuclear-encoded mitochondrial sequences. Notably, we construct a consensus chrM sequence for each sample and recall heteroplasmy against the sample's unique mitochondrial genome. We observe an approximately 3-fold increased association with age for heteroplasmic variants in non-homopolymer regions and, are better able to capture genetic variation in the D-loop of chrM compared to existing software. Our bioinformatics pipeline more accurately captures features of mitochondrial genetics than existing pipelines that are important in understanding how mitochondrial dysfunction contributes to disease.
Background Seborrheic dermatitis (SD) is a chronic inflammatory skin disease with a multifactorial aetiology. Malassezia yeasts have been associated with the disease but the role of bacterial composition in SD has not been thoroughly investigated. Objectives To profile the bacterial microbiome of SD patients and compare this with the microbiome of individuals with no inflammatory skin disease (controls). Methods This was a cross sectional study embedded in a population-based study. Skin swabs were taken from naso-labial fold from patients with seborrheic dermatitis (lesional skin: n = 22; non-lesional skin SD: n = 75) and controls (n = 465). Sample collection began in 2016 at the research facility and is still ongoing. Shannon and Chao1 α- diversity metrics were calculated per group. Associations between the microbiome composition of cases and controls was calculated using multivariate statistics (permANOVA) and univariate statistics. Results We found an increased α-diversity between SD lesional cases versus controls (Shannon diversity: Kruskal-Wallis rank sum: Chi-squared: 19.06; global p-value = 7.7x10-5). Multivariate statistical analysis showed significant associations in microbiome composition when comparing lesional SD skin to controls (p-value = 0.03;R2 = 0.1%). Seven out of 13 amplicon sequence variants (ASVs) that were significantly different between controls and lesional cases were members of the genus Staphylococcus, most of which showed increased composition in lesional cases, and were closely related to S. capitis S. caprae and S. epidermidis. Conclusion Microbiome composition differs in patients with seborrheic dermatitis and individuals without diseases. Differences were mainly found in the genus Staphylococcus.
Most existing TWAS tools require individual-level eQTL reference data and thus are not applicable to summary-level reference eQTL datasets. The development of TWAS methods that can harness summary-level reference data is valuable to enable TWAS in broader settings and enhance power due to increased reference sample size. Thus, we develop a TWAS framework called OTTERS (Omnibus Transcriptome Test using Expression Reference Summary data) that adapts multiple polygenic risk score (PRS) methods to estimate eQTL weights from summary-level eQTL reference data and conducts an omnibus TWAS. We show that OTTERS is a practical and powerful TWAS tool by both simulations and application studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.