SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding.
BackgroundIn sexually reproducing organisms, meiotic crossovers ensure the proper segregation of chromosomes and contribute to genetic diversity by shuffling allelic combinations. Such genetic reassortment is exploited in breeding to combine favorable alleles, and in genetic research to identify genetic factors underlying traits of interest via linkage or association-based approaches. Crossover numbers and distributions along chromosomes vary between species, but little is known about their intraspecies variation.ResultsHere, we report on the variation of recombination rates between 22 European maize inbred lines that belong to the Dent and Flint gene pools. We genotype 23 doubled-haploid populations derived from crosses between these lines with a 50 k-SNP array and construct high-density genetic maps, showing good correspondence with the maize B73 genome sequence assembly. By aligning each genetic map to the B73 sequence, we obtain the recombination rates along chromosomes specific to each population. We identify significant differences in recombination rates at the genome-wide, chromosome, and intrachromosomal levels between populations, as well as significant variation for genome-wide recombination rates among maize lines. Crossover interference analysis using a two-pathway modeling framework reveals a negative association between recombination rate and interference strength.ConclusionsTo our knowledge, the present work provides the most comprehensive study on intraspecific variation of recombination rates and crossover interference strength in eukaryotes. Differences found in recombination rates will allow for selection of high or low recombining lines in crossing programs. Our methodology should pave the way for precise identification of genes controlling recombination rates in maize and other organisms.
The efficiency of marker-assisted prediction of phenotypes has been studied intensively for different types of plant breeding populations. However, one remaining question is how to incorporate and counterbalance information from biparental and multiparental populations into model training for genome-wide prediction. To address this question, we evaluated testcross performance of 1652 doubled-haploid maize (Zea mays L.) lines that were genotyped with 56,110 single nucleotide polymorphism markers and phenotyped for five agronomic traits in four to six European environments. The lines are arranged in two diverse half-sib panels representing two major European heterotic germplasm pools. The data set contains 10 related biparental dent families and 11 related biparental flint families generated from crosses of maize lines important for European maize breeding. With this new data set we analyzed genome-based best linear unbiased prediction in different validation schemes and compositions of estimation and test sets. Further, we theoretically and empirically investigated marker linkage phases across multiparental populations. In general, predictive abilities similar to or higher than those within biparental families could be achieved by combining several half-sib families in the estimation set. For the majority of families, 375 half-sib lines in the estimation set were sufficient to reach the same predictive performance of biomass yield as an estimation set of 50 full-sib lines. In contrast, prediction across heterotic pools was not possible for most cases. Our findings are important for experimental design in genome-based prediction as they provide guidelines for the genetic structure and required sample size of data sets used for model training. IN the context of quantitative trait locus (QTL) mapping, multiparental populations have been suggested to be advantageous over biparental families due to their greater allelic diversity and the possibility of evaluating allelic effects in multiple genetic backgrounds (Muranty 1996;Xu 1998;Verhoeven et al. 2006). Especially if the multiparental population consists of several families connected by common parents, they can provide greater power of QTL detection and better resolution of QTL localization compared to individual families (Rebai and Goffinet 1993;Jannink and Jansen 2001;Blanc et al. 2006;Yu et al. 2008;Bardol et al. 2013;Mackay et al. 2014). In the context of genome-based prediction (Meuwissen et al. 2001), accuracies achieved within large biparental families are assumed to be the maximum that can be obtained with a given sample size (Crossa et al. 2014), because of medium allele frequencies, absence of genetic substructure, and equal linkage phases between markers and functional polymorphisms. However, prediction accuracies of newly generated progenies from different crosses will be poor. This is especially true if the respective germplasm exhibits broad allelic diversity and is unrelated to the biparental family from which single nucleotide polymorphism (...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.