BackgroundGenotype imputation can help reduce genotyping costs particularly for implementation of genomic selection. In applications entailing large populations, recovering the genotypes of untyped loci using information from reference individuals that were genotyped with a higher density panel is computationally challenging. Popular imputation methods are based upon the Hidden Markov model and have computational constraints due to an intensive sampling process. A fast, deterministic approach, which makes use of both family and population information, is presented here. All individuals are related and, therefore, share haplotypes which may differ in length and frequency based on their relationships. The method starts with family imputation if pedigree information is available, and then exploits close relationships by searching for long haplotype matches in the reference group using overlapping sliding windows. The search continues as the window size is shrunk in each chromosome sweep in order to capture more distant relationships.ResultsThe proposed method gave higher or similar imputation accuracy than Beagle and Impute2 in cattle data sets when all available information was used. When close relatives of target individuals were present in the reference group, the method resulted in higher accuracy compared to the other two methods even when the pedigree was not used. Rare variants were also imputed with higher accuracy. Finally, computing requirements were considerably lower than those of Beagle and Impute2. The presented method took 28 minutes to impute from 6 k to 50 k genotypes for 2,000 individuals with a reference size of 64,429 individuals.ConclusionsThe proposed method efficiently makes use of information from close and distant relatives for accurate genotype imputation. In addition to its high imputation accuracy, the method is fast, owing to its deterministic nature and, therefore, it can easily be used in large data sets where the use of other methods is impractical.
The objectives of this paper were to briefly review progress in the genetic evaluation of novel traits in Canada and the United States, assess methods to predict selection accuracy based on cow reference populations, and illustrate how the use of indicator traits could increase genomic selection accuracy. Traits reviewed are grouped into the following categories: udder health, hoof health, other health traits, feed efficiency and methane emissions, and other novel traits. The status of activities expected to lead to national genetic evaluations is indicated for each group of traits. For traits that are more difficult to measure or expensive to collect, such as individual feed intake or immune response, the development of a cow reference population is the most effective approach. Several deterministic methods can be used to predict the reliability of genomic evaluations based on cow reference population size, trait heritability, and other population parameters. To provide an empirical validation of those methods, predicted accuracies were compared with observed accuracies for several cow reference populations and traits. Reference populations of 2,000 to 20,000 cows were created through random sampling of genotyped Holstein cows in Canada and the United States. The effects of single nucleotide polymorphisms (SNP) were estimated from those cow records, after excluding the dams of validation bulls. Bulls that were first progeny tested in 2013 and 2014 were then used to carry out a validation and estimate the observed accuracy of genomic selection based on those SNP effects. Over the various cow population sizes and traits considered in the study, even the best prediction methods were found, on average, to either under-evaluate observed accuracy by 0.20 or over-evaluate it by 0.22, depending on the approach used to estimate the number of independently segregating chromosome segments. In some instances, differences between observed and predicted accuracies were as large as 0.47. Indicator traits can be very useful for the selection of novel traits. To illustrate this, protein yield, body weight, and mid-infrared data were used as indicator traits for feed efficiency. Using those traits in conjunction with 5,000 cow records for dry matter intake increased the reliability of genomic predictions for young animals from 0.20 to 0.50.
Johne's disease (or paratuberculosis), caused by Mycobacterium avium ssp. paratuberculosis (MAP) infection, is a globally prevalent disease with severe economic and welfare implications. With no effective treatment available, understanding the role of genetics influencing host infection status is essential to develop selection strategies to breed for increased resistance to MAP infection. The main objectives of this study were to estimate genetic parameters for the MAP-specific antibody response using milk ELISA scores in Canadian Holstein cattle as an indicator of resistance to Johne's disease, and to unravel genomic regions and candidate genes significantly associated with MAP infection. After data editing, 168,987 milk ELISA records from 2,306 herds, obtained from CanWest Dairy Herd Improvement, were used for further analyses. Variance and heritability estimates for MAP infection status were determined using univariate linear animal models under 3 scenarios: (a) SCEN1: the complete data set (all herds); (b) SCEN2: herds with at least one suspect or test-positive animal (ELISA optical density ≥0.07); and (c) SCEN3: herds with at least one test-positive animal (ELISA optical density ≥0.11). Heritability estimates were calculated as 0.066, 0.064, and 0.063 for SCEN1, SCEN2, and SCEN3, respectively. The correlations between estimated breeding values for resistance to MAP infection and other economically important traits, when significant, were favorable and of low magnitude. Genome-wide association analyses identified important genomic regions on Bos taurus autosome (BTA)1, BTA7, BTA9, BTA14, BTA15, BTA17, BTA19, and BTA25 showing significant association with MAP infection status. These regions included 2 single nucleotide polymorphisms located 2 kb upstream of positional candidate genes CD86 and WNT9B, which play key roles in host immune response and tissue homeostasis. This study revealed the genetic architecture of MAP infection in Canadian Holstein cattle as measured by milk ELISA scores by estimating genetic parameters along with the identification of genomic regions potentially influencing MAP infection status. These findings will be of significant value toward implementing genetic and genomic evaluations for resistance to MAP infection in Holstein cattle.
BackgroundGenome-wide profiling of single-nucleotide polymorphisms is receiving increasing attention as a method of pre-implantation genetic diagnosis in humans and of commercial genotyping of pre-transfer embryos in cattle. However, the very small quantity of genomic DNA in biopsy material from early embryos poses daunting technical challenges. A reliable whole-genome amplification (WGA) procedure would greatly facilitate the procedure.ResultsSeveral PCR-based and non-PCR based WGA technologies, namely multiple displacement amplification, quasi-random primed library synthesis followed by PCR, ligation-mediated PCR, and single-primer isothermal amplification were tested in combination with different DNA extractions protocols for various quantities of genomic DNA inputs. The efficiency of each method was evaluated by comparing the genotypes obtained from 15 cultured cells (representative of an embryonic biopsy) to unamplified reference gDNA. The gDNA input, gDNA extraction method and amplification technology were all found to be critical for successful genome-wide genotyping. The selected WGA platform was then tested on embryo biopsies (n = 226), comparing their results to that of biopsies collected after birth. Although WGA inevitably leads to a random loss of information and to the introduction of erroneous genotypes, following genomic imputation the resulting genetic index of both sources of DNA were highly correlated (r = 0.99, P<0.001).ConclusionIt is possible to generate high-quality DNA in sufficient quantities for successful genome-wide genotyping starting from an early embryo biopsy. However, imputation from parental and population genotypes is a requirement for completing and correcting genotypic data. Judicious selection of the WGA platform, careful handling of the samples and genomic imputation together, make it possible to perform extremely reliable genomic evaluations for pre-transfer embryos.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-889) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.