Our objective was to perform a genome-wide association study for content in bovine milk of α(S1)-casein (α(S1)-CN), α(S2)-casein (α(S2)-CN), β-casein (β-CN), κ-casein (κ-CN), α-lactalbumin (α-LA), β-lactoglobulin (β-LG), casein index, protein percentage, and protein yield using a 50K single nucleotide polymorphism (SNP) chip. In total, 1,713 Dutch Holstein-Friesian cows were genotyped for 50,228 SNP and a 2-step association study was performed. The first step involved a general linear model and the second step used a mixed model accounting for all family relationships. Associations with milk protein content and composition were detected on 20 bovine autosomes. The main genomic regions associated with milk protein composition or protein percentage were found on chromosomes 5, 6, 11, and 14. The number of chromosomal regions showing significant (false discovery rate <0.01) effects ranged from 3 for β-CN and 3 for β-LG to 12 for α(S2)-CN. A genomic region on Bos taurus autosome (BTA) 6 was significantly associated with all 6 major milk proteins, and a genomic region on BTA 11 was significantly associated with the 4 caseins and β-LG. In addition, regions were detected that only showed a significant effect on one of the milk protein fractions: regions on BTA 13 and 22 with effects on α(S1)-CN; regions on BTA 1, 9, 10, 17, 19, and 28 with effects on α(S2)-CN; a region on BTA 6 with an effect on β-CN; regions on BTA 13 and 21 with effects on κ-CN; regions on BTA 1, 5, 9, 16, 17, and 26 with effects on α-LA; and a region on BTA 24 with an effect on β-LG. The proportion of genetic variance explained by the SNP showing the strongest association in each of these genomic regions ranged from <1% for α(S1)-CN on BTA 22 to almost 100% for casein index on BTA 11. Variation associated with regions on BTA 6, 11, and 14 could in large part but not completely be explained by known protein variants of β-CN (BTA 6), κ-CN (BTA 6), and β-LG (BTA 11) or DGAT1 variants (BTA 14). Our results indicate 3 regions with major effects on milk protein composition, in addition to several regions with smaller effects involved in the regulation of milk protein composition.
SummaryThe objective of this study was to perform a whole genome scan to detect quantitative trait loci (QTL) for milk protein composition in 849 Holstein–Friesian cows originating from seven sires. One morning milk sample was analysed for the major milk proteins using capillary zone electrophoresis. A genetic map was constructed with 1341 single nucleotide polymorphisms, covering 2829 centimorgans (cM) and 95% of the cattle genome. The chromosomal regions most significantly related to milk protein composition (Pgenome < 0.05) were found on Bos taurus autosomes (BTA) 6, 11 and 14. The QTL on BTA6 was found at about 80 cM, and affected αS1‐casein, αS2‐casein, β‐casein and κ‐casein. The QTL on BTA11 was found at 124 cM, and affected β‐lactoglobulin, and the QTL on BTA14 was found at 0 cM, and affected protein percentage. The proportion of phenotypic variance explained by the QTL was 3.6% for β‐casein and 7.9% for κ‐casein on BTA6, 28.3% for β‐lactoglobulin on BTA11, and 8.6% for protein percentage on BTA14. The QTL affecting αS2‐casein on BTA6 and 17 showed a significant interaction. We investigated the extent to which the detected QTL affecting milk protein composition could be explained by known polymorphisms in β‐casein, κ‐casein, β‐lactoglobulin and DGAT1 genes. Correction for these polymorphisms decreased the proportion of phenotypic variance explained by the QTL previously found on BTA6, 11 and 14. Thus, several significant QTL affecting milk protein composition were found, of which some QTL could partially be explained by polymorphisms in milk protein genes.
BackgroundA newly recognized type of genetic variation, Copy Number Variation (CNV), is detected in mammalian genomes, e.g. the cattle genome. This form of variation can potentially cause phenotypic variation. Our objective was to determine whether dense SNP (single nucleotide polymorphisms) panels can capture the genetic variation due to a simple bi-allelic CNV, with the prospect of including the effect of such structural variations into genomic predictions.MethodsA deletion type CNV on bovine chromosome 6 was predicted from its neighboring SNP with a multiple regression model. Our dataset consisted of CNV genotypes of 1,682 cows, along with 100 surrounding SNP genotypes. A prediction model was fitted considering 10 to 100 surrounding SNP and the accuracy obtained directly from the model was confirmed by cross-validation.Results and conclusionsThe accuracy of prediction increased with an increasing number of SNP in the model and the predicted accuracies were similar to those obtained by cross-validation. A substantial increase in accuracy was observed when the number of SNP increased from 10 to 50 but thereafter the increase was smaller, reaching the highest accuracy (0.94) with 100 surrounding SNP. Thus, we conclude that the genotype of a deletion type CNV and its putative QTL effect can be predicted with a maximum accuracy of 0.94 from surrounding SNP. This high prediction accuracy suggests that genetic variation due to simple deletion CNV is well captured by dense SNP panels. Since genomic selection relies on the availability of a dense marker panel with markers in close linkage disequilibrium to the QTL in order to predict their genetic values, we also discuss opportunities for genomic selection to predict the effects of CNV by dense SNP panels, when CNV cause variation in quantitative traits.
eBiomics (www.ebiomics.org) is an e-learning interactive platform in bioinformatics intended as a support to postgraduate students and scientists involved in-omics or systems biology projects. The heterogeneity and the multitude of bioinformatics resources (databases or tools) make their individual use and appreciation difficult though difficulties can be overcome through indexing examples of use in specific contexts. eBiomics is presented as a didactic guide for an extensive range of on-line databases and tools commonly referred to in-omics applications. In its self-training version, it was developed with a free content management system and contains light textual information complemented with numerous screenshots and visual information in a user-friendly environment. A collection of protocols and case studies is indexed for finding representative and practical usages of each of the resources described. Most references to on-line databases and tools are accessible in real time through cross-links. The content covers main-omics related topics. Such a flexible tool allows keeping up with the constant evolution imposed by frequent new releases of databases, upgrades of on-line software and regular changes of query interfaces, which usually precludes from publishing helpful tutorials in books and manuals (otherwise rapidly becoming obsolete). In addition, it offers access to a self-testing section. A complementing sister website focused on biomedical academic studies includes part of the self-training content in a different environment.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.