BackgroundCurrently, genomic prediction in cattle is largely based on panels of about 54k single nucleotide polymorphisms (SNPs). However with the decreasing costs of and current advances in next-generation sequencing technologies, whole-genome sequence (WGS) data on large numbers of individuals is within reach. Availability of such data provides new opportunities for genomic selection, which need to be explored.MethodsThis simulation study investigated how much predictive ability is gained by using WGS data under scenarios with QTL (quantitative trait loci) densities ranging from 45 to 132 QTL/Morgan and heritabilities ranging from 0.07 to 0.30, compared to different SNP densities, with emphasis on divergent dairy cattle breeds with small populations. The relative performances of best linear unbiased prediction (SNP-BLUP) and of a variable selection method with a mixture of two normal distributions (MixP) were also evaluated. Genomic predictions were based on within-population, across-population, and multi-breed reference populations.ResultsThe use of WGS data for within-population predictions resulted in small to large increases in accuracy for low to moderately heritable traits. Depending on heritability of the trait, and on SNP and QTL densities, accuracy increased by up to 31 %. The advantage of WGS data was more pronounced (7 to 92 % increase in accuracy depending on trait heritability, SNP and QTL densities, and time of divergence between populations) with a combined reference population and when using MixP. While MixP outperformed SNP-BLUP at 45 QTL/Morgan, SNP-BLUP was as good as MixP when QTL density increased to 132 QTL/Morgan.ConclusionsOur results show that, genomic predictions in numerically small cattle populations would benefit from a combination of WGS data, a multi-breed reference population, and a variable selection method.
Genome-wide breeding value (GWEBV) estimation methods can be classified based on the prior distribution assumptions of marker effects. Genome-wide BLUP methods assume a normal prior distribution for all markers with a constant variance, and are computationally fast. In Bayesian methods, more flexible prior distributions of SNP effects are applied that allow for very large SNP effects although most are small or even zero, but these prior distributions are often also computationally demanding as they rely on Monte Carlo Markov chain sampling. In this study, we adopted the Pareto principle to weight available marker loci, i.e., we consider that x% of the loci explain (100 - x)% of the total genetic variance. Assuming this principle, it is also possible to define the variances of the prior distribution of the 'big' and 'small' SNP. The relatively few large SNP explain a large proportion of the genetic variance and the majority of the SNP show small effects and explain a minor proportion of the genetic variance. We name this method MixP, where the prior distribution is a mixture of two normal distributions, i.e. one with a big variance and one with a small variance. Simulation results, using a real Norwegian Red cattle pedigree, show that MixP is at least as accurate as the other methods in all studied cases. This method also reduces the hyper-parameters of the prior distribution from 2 (proportion and variance of SNP with big effects) to 1 (proportion of SNP with big effects), assuming the overall genetic variance is known. The mixture of normal distribution prior made it possible to solve the equations iteratively, which greatly reduced computation loads by two orders of magnitude. In the era of marker density reaching million(s) and whole-genome sequence data, MixP provides a computationally feasible Bayesian method of analysis.
BackgroundGenomic prediction is based on the accurate estimation of the genomic relationships among and between training animals and selection candidates in order to obtain accurate estimates of the genomic estimated breeding values (GEBV). Various methods have been used to predict GEBV based on population-wide linkage disequilibrium relationships (GIBS) or sometimes on linkage analysis relationships (GLA). Here, we propose a novel method to predict GEBV based on a genomic relationship matrix using runs of homozygosity (GROH). Runs of homozygosity were used to derive probabilities of multi-locus identity by descent chromosome segments. The accuracy and bias of the prediction of GEBV using GROH were compared to those using GIBS and GLA. Comparisons were performed using simulated datasets derived from a random pedigree and a real pedigree of Italian Brown Swiss bulls. The comparison of accuracies of GEBV was also performed on data from 1086 Italian Brown Swiss dairy cattle.ResultsSimulations with various thresholds of minor allele frequency for markers and quantitative trait loci showed that GROH achieved consistently more accurate GEBV (0 to 4% points higher) than GIBS and GLA. The bias of GEBV prediction for simulated data was higher based on the real pedigree than based on a random pedigree. In the analyses with real data, GROH and GLA had similar accuracies. However, GLA achieved a higher accuracy when the prediction was done on the youngest animals. The GIBS matrices calculated with and without standardized marker genotypes resulted in similar accuracies.ConclusionsThe present study proposes GROH as a novel method to estimate genomic relationship matrices and predict GEBV based on runs of homozygosity and shows that it can result in higher or similar accuracies of GEBV prediction than GLA, except for the real data analysis with validation of young animals. Compared to GIBS, GROH resulted in more accurate GEBV predictions.
Background: Edwardsiella tarda causes acute symptoms with ascites in Japanese flounder (Paralichthys olivaceus) and is a major problem for China's aquaculture sector. Genomic selection (GS) has been widely adopted in breeding industries because it shortens generation intervals and results in the selection of individuals that have great breeding potential with high accuracy. Based on an artificial challenge test and re-sequenced data of 1099 flounders, the aims of this study were to estimate the genetic parameters of resistance to E. tarda in Japanese flounder and to evaluate the accuracy of single-step GBLUP (ssGBLUP), weighted ssGBLUP (WssGBLUP), and BayesB for improving resistance to E. tarda by using three subsets of pre-selected single nucleotide polymorphisms (SNPs). In addition, SNPs that are associated with this trait were identified using a single-SNP genome-wide association study (GWAS) and WssGBLUP. Results: We estimated a heritability of 0.13 ± 0.02 for resistance to E. tarda in Japanese flounder. One million SNPs at fixed intervals were selected from 4,978,724 SNPs that passed quality controls. GWAS identified significant SNPs on chromosomes 14 and 24. WssGBLUP revealed that the putative quantitative trait loci on chromosomes 1 and 14 contained SNPs that explained more than 1% of the genetic variance. Three 50 k-SNP subsets were pre-selected based on different criteria. Compared with pedigree-based prediction (ABLUP), the three genomic methods evaluated resulted in at least 7.7% greater accuracy of predictions. The accuracy of these genomic prediction methods was almost unchanged when pre-selected trait-related SNPs were used for prediction. Conclusions: Resistance to E. tarda in Japanese flounder has a low heritability. GWAS and WssGBLUP revealed that the genetic architecture of this trait is polygenic. Genomic prediction of breeding values performed better than ABLUP. It is feasible to implement genomic selection to increase resistance to E. tarda in Japanese flounder with 50 k SNPs. Based on the criteria used here, pre-selection of SNPs was not beneficial and other criteria for pre-selection should be considered.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.