Adult height is a highly heritable trait. Here we identified 31.6 million sequence variants by whole-genome sequencing of 8,453 Icelanders and tested them for association with adult height by imputing them into 88,835 Icelanders. Here we discovered 13 novel height associations by testing four different models including parent-of-origin (|b| ¼ 0.4-10.6 cm). The minor alleles of three parent-of-origin signals associate with less height only when inherited from the father and are located within imprinted regions (IGF2-H19 and DLK1-MEG3). We also examined the association of these sequence variants in a set of 12,645 Icelanders with birth length measurements. Two of the novel variants, (IGF2-H19 and TET1), show significant association with both adult height and birth length, indicating a role in early growth regulation. Among the parent-of-origin signals, we observed opposing parental effects raising questions about underlying mechanisms. These findings demonstrate that common variations affect human growth by parental imprinting.
Creatine kinase (CK) and lactate dehydrogenase (LDH) are widely used markers of tissue damage. To search for sequence variants influencing serum levels of CK and LDH, 28.3 million sequence variants identified through whole-genome sequencing of 2,636 Icelanders were imputed into 63,159 and 98,585 people with CK and LDH measurements, respectively. Here we describe 13 variants associating with serum CK and 16 with LDH levels, including four that associate with both. Among those, 15 are non-synonymous variants and 12 have a minor allele frequency below 5%. We report sequence variants in genes encoding the enzymes being measured (CKM and LDHA), as well as in genes linked to muscular (ANO5) and immune/inflammatory function (CD163/CD163L1, CSF1, CFH, HLA-DQB1, LILRB5, NINJ1 and STAB1). A number of the genes are linked to the mononuclear/phagocyte system and clearance of enzymes from the serum. This highlights the variety in the sources of normal diversity in serum levels of enzymes.
Analysis of EuroTARGET cohort data will contribute to personalization of therapy for patients with mRCC. The extensive clinical data and multiplatform EuroTARGET data will be freely available.
SummaryLarge resequencing projects require a significant amount of storage for raw sequences, as well as alignment files. Since the raw sequences are redundant once the alignment has been generated, it is possible to keep only the alignment files. We present BamHash, a checksum based method to ensure that the read pairs in FASTQ files match exactly the read pairs stored in BAM files, regardless of the ordering of reads. BamHash can be used to verify the integrity of the files stored and discover any discrepancies. Thus, BamHash can be used to determine if it is safe to delete the FASTQ files storing raw sequencing reads after alignment, without the loss of data.
Availability and ImplementationThe software is implemented in C++, GPL licensed and available at https://github.com/DecodeGenetics/BamHash
Contactpmelsted@hi.is
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.