BackgroundAccurate genomic analyses are predicated upon access to accurate genotype input data. The objective of this study was to quantify the reproducibility of genotype data that are generated from the same genotype platform and from different genotyping platforms.MethodsGenotypes based on 51,121 single nucleotide polymorphisms (SNPs) for 84 animals that were each genotyped on Illumina and Affymetrix platforms and for another 25 animals that were each genotyped twice on the same Illumina platform were compared. Genotypes based on 11,323 SNPs for an additional 21 animals that were genotyped on two different Illumina platforms by two different service providers were also compared. Reproducibility of the results was measured as the correlation between allele counts and as genotype and allele concordance rates.ResultsA mean within-animal correlation of 0.9996 was found between allele counts in the 25 duplicate samples that were genotyped on the same Illumina platform and varied from 0.9963 to 1.0000 per animal. The mean (minimum, maximum) genotype and allele concordance rates per animal between the 25 duplicate samples were equal to 0.9996 (0.9968, 1.0000) and 0.9993 (0.9937, 1.0000), respectively. The concordance rate between the two different Illumina platforms was also near 1. A mean within-animal correlation of 0.9738 was found between genotypes that were generated on the Illumina and Affymetrix platforms and varied from 0.9505 to 0.9812 per animal. The mean (minimum, maximum) within-animal genotype and allele concordance rates between the Illumina and Affymetrix platforms were equal to 0.9711 (0.9418, 0.9798) and 0.9845 (0.9695, 0.9889), respectively. The genotype concordance rate across all genotypes increased from 0.9711 to 0.9949 when the SNPs used were restricted to those with three high-resolution genotype clusters which represented 75.2% of the called genotypes.Conclusions and implicationsOur results suggest that, regardless of the genotype platform or service provider, high genotype concordance rates are achieved especially if they are restricted to high-quality extracted DNA and SNPs that result in high-quality genotypes.
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. Highlights Economic values are widely used in the development of breeding objectives internationally. The economic value of a trait in a breeding objective can be defined as the change in profit value of a unit change in an individual trait, while keeping all other traits constant. A total of fourteen traits of economic importance representing maternal, lambing, production and health characteristics were calculated within a whole farm bioeconomic model. Results from this study will enable the implementation of new economic values within the national terminal and maternal Irish sheep breeding objectives which highlights the traits of importance for increasing overall farm profitability.
The objective of the present study was to quantify the extent of genetic variation in three health-related traits namely dagginess, lameness and mastitis, in an Irish sheep population. Each of the health traits investigated pose substantial welfare implications as well as considerable economic costs to producers. Data were also available on four body-related traits, namely body condition score (BCS), live weight, muscle depth and fat depth. Animals were categorised as lambs (<365 days old) or ewes (⩾365 days old) and were analysed both separately and combined. After edits, 39 315 records from 264 flocks between the years 2009 and 2015 inclusive were analysed. Variance components were estimated using animal linear mixed models. Fixed effects included contemporary group, represented as a three-way interaction between flock, date of inspection and animal type (i.e. lamb, yearling ewe (i.e. females ⩾365 days but <730 days old that have not yet had a recorded lambing) or ewe), animal breed proportion, coefficients of heterosis and recombination, animal gender (lambs only), animal parity (ewes only; lambs were assigned a separate 'parity') and the difference in age of the animal from the median of the respective parity/age group. An additive genetic effect and residual effect were both fitted as random terms with maternal genetic and non-genetic components also considered for traits of the lambs. The direct heritability of dagginess was similar across age groups (0.14 to 0.15), whereas the direct heritability of lameness ranged from 0.06 (ewes) to 0.12 (lambs). The direct heritability of mastitis was 0.04. For dagginess, 13% of the phenotypic variation was explained by dam litter, whereas the maternal heritability of dagginess was 0.05. The genetic correlation between ewe and lamb dagginess was 0.38; the correlation between ewe and lamb lameness was close to zero but was associated with a large standard error. Direct genetic correlations were evident between dagginess and BCS in ewes and between lameness and BCS in lambs. The present study has demonstrated that ample genetic variation exists for all three health traits investigated indicating that genetic improvement is indeed possible.
The objective of the present study was to quantify the accuracy of imputing medium-density single nucleotide polymorphism (SNP) genotypes from lower-density panels (384 to 12,000 SNPs) derived using alternative selection methods to select the most informative SNPs. Four different selection methods were used to select SNPs based on genomic characteristics (i.e., minor allele frequency (MAF) and linkage disequilibrium (LD)) within five sheep breeds (642 Belclare, 645 Charollais, 715 Suffolk, 440 Texel, and 620 Vendeen) separately. Selection methods evaluated included (i) random, (ii) splitting the genome into blocks of equal length and selecting SNPs within block based on MAF and LD patterns, (iii) equidistant location while optimizing MAF, (iv) a combination of MAF, distance from already selected SNPs, and weak LD with the SNP(s) already selected. All animals were genotyped on the Illumina OvineSNP50 Beadchip containing 51,135 SNPs of which 44,040 remained after edits. Within each breed separately, the youngest 100 animals were assumed to represent the validation population; the remaining animals represented the reference population. Imputation was undertaken under three different conditions: (i) SNPs were selected within a given breed and imputed for all breeds individually, (ii) all breeds were collectively used to select SNPs and were included as the reference population, and (iii) the SNPs were selected for each breed separately and imputation was undertaken for all breeds but excluding from the reference population, the breed from which the SNPs were selected. Regardless of SNP selection method, mean animal allele concordance rate improved at a diminishing rate while the variability in mean animal allele concordance rate reduced as the panel density increased. The SNP selection method impacted the accuracy of imputation although the effect reduced as the density of the panel increased. Overall, the most accurate SNP selection method for panels with <9,000 SNPs was that based on MAF and LD pattern within genomic blocks. The mean animal allele concordance rate varied from 0.89 in Texel to 0.97 in Vendeen. Greater imputation accuracy was achieved when SNPs were selected and imputed within each breed individually compared with when SNPs were selected across all breeds and imputed using a multi-breed reference population. In all, results indicate that accurate genotype imputation to medium density is achievable with low-density genotype panels with at least 6,000 SNPs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.