Summary Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant global health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including data for 563,085 European ancestry participants, and discover 5,106 new genetic variants independently associated with 29 blood cell phenotypes covering a range of variation impacting hematopoiesis. We holistically characterize the genetic architecture of hematopoiesis, assess the relevance of the omnigenic model to blood cell phenotypes, delineate relevant hematopoietic cell states influenced by regulatory genetic variants and gene networks, identify novel splice-altering variants mediating the associations, and assess the polygenic prediction potential for blood traits and clinical disorders at the interface of complex and Mendelian genetics. These results show the power of large-scale blood cell trait GWAS to interrogate clinically meaningful variants across a wide allelic spectrum of human variation.
High-throughput sequencing of full-length transcripts using long reads has paved the way for the discovery of thousands of novel transcripts, even in well-annotated mammalian species. The advances in sequencing technology have created a need for studies and tools that can characterize these novel variants. Here, we present SQANTI, an automated pipeline for the classification of long-read transcripts that can assess the quality of data and the preprocessing pipeline using 47 unique descriptors. We apply SQANTI to a neuronal mouse transcriptome using Pacific Biosciences (PacBio) long reads and illustrate how the tool is effective in characterizing and describing the composition of the full-length transcriptome. We perform extensive evaluation of ToFU PacBio transcripts by PCR to reveal that an important number of the novel transcripts are technical artifacts of the sequencing approach and that SQANTI quality descriptors can be used to engineer a filtering strategy to remove them. Most novel transcripts in this curated transcriptome are novel combinations of existing splice sites, resulting more frequently in novel ORFs than novel UTRs, and are enriched in both general metabolic and neural-specific functions. We show that these new transcripts have a major impact in the correct quantification of transcript levels by state-of-the-art short-read-based quantification algorithms. By comparing our iso-transcriptome with public proteomics databases, we find that alternative isoforms are elusive to proteogenomics detection. SQANTI allows the user to maximize the analytical outcome of long-read technologies by providing the tools to deliver quality-evaluated and curated full-length transcriptomes.
(292 words) 22High-throughput sequencing of full-length transcripts using long reads has paved the way for the discovery of 23 thousands of novel transcripts, even in very well annotated organisms as mice and humans. Nonetheless, there is a 24 need for studies and tools that characterize these novel isoforms. Here we present SQANTI, an automated pipeline 25 for the classification of long-read transcripts that computes 47 descriptors that can be used to assess the quality of 26 the data and of the preprocessing pipelines. We applied SQANTI to a neuronal mouse transcriptome using PacBio 27 long reads and illustrate how the tool is effective in readily describing the composition of and characterizing the full-28 length transcriptome. We perform extensive evaluation of ToFU PacBio transcripts by PCR to reveal that an 29 important number of the novel transcripts are technical artifacts of the sequencing approach, and that SQANTI 30 quality descriptors can be used to engineer a filtering strategy to remove them. Most novel transcripts in this curated 31 transcriptome are novel combinations of existing splice sites, result more frequently in novel ORFs than novel UTRs 32 and are enriched in both general metabolic and neural specific functions. We show that these new transcripts have a 33 major impact in the correct quantification of transcript levels by state-of-the-art short-read based quantification 34 algorithms. By comparing our iso-transcriptome with public proteomics databases we find that alternative isoforms
Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including 563,946 European ancestry participants, and discover 5,106 new genetic variants independently associated with 29 blood cell phenotypes covering the full allele frequency spectrum of variation impacting hematopoiesis. We holistically characterize the genetic architecture of hematopoiesis, assess the relevance of the omnigenic model to blood cell phenotypes, delineate relevant hematopoietic cell states influenced by regulatory genetic variants and gene networks, identify novel splice-altering variants mediating the associations, and assess the polygenic prediction potential for blood cell traits and clinical disorders at the interface of complex and Mendelian genetics. These results show the power of large-scale blood cell GWAS to interrogate clinically meaningful variants across the full allelic spectrum of human variation.
Chemokines are relevant molecules in shaping the tumor microenvironment, although their contributions to tumorigenesis are not fully understood. We studied the influence of the chemokine CX3CL1/fractalkine in de novo breast cancer formation using HER2/neu transgenic mice. CX3CL1 expression was downmodulated in HER2/neu tumors, yet, paradoxically, adenovirus-mediated CX3CL1 expression in the tumor milieu enhanced mammary tumor numbers in a dose-dependent manner. Increased tumor multiplicity was not a consequence of CX3CL1-induced metastatic dissemination of the primary tumor, although CX3CL1 induced epithelial-to-mesenchymal transition in breast cancer cells in vitro. Instead, CX3CL1 triggered cell proliferation by induction of ErbB receptors through the proteolytic shedding of an ErbB ligand. This effect was important insofar as mammary tumorigenesis was delayed and tumor multiplicity was reduced by genetic deletion of CX3CL1 in HER2/neu mice, but not in polyoma middle T-antigen oncomice. Our findings support the conclusion that CX3CL1 acts as a positive modifier of breast cancer in concert with ErbB receptors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.