Maize genetic diversity has been used to understand the molecular basis of phenotypic variation and to improve agricultural efficiency and sustainability. We crossed 25 diverse inbred maize lines to the B73 reference line, capturing a total of 136,000 recombination events. Variation for recombination frequencies was observed among families, influenced by local (cis) genetic variation. We identified evidence for numerous minor single-locus effects but little two-locus linkage disequilibrium or segregation distortion, which indicated a limited role for genes with large effects and epistatic interactions on fitness. We observed excess residual heterozygosity in pericentromeric regions, which suggested that selection in inbred lines has been less efficient in these regions because of reduced recombination frequency. This implies that pericentromeric regions may contribute disproportionally to heterosis.
In ecological analyses of species and community distributions there is interest in the nature of their responses to environmental gradients and in identifying the most important environmental variables, which may be used for predicting patterns of biodiversity. Methods such as random forests already exist to assess predictor importance for individual species and to indicate where along gradients abundance changes. However, there is a need to extend these methods to whole assemblages, to establish where along the range of these gradients the important compositional changes occur, and to identify any important thresholds or change points. We develop such a method, called "gradient forest," which is an extension of the random forest approach. By synthesizing the cross-validated R2 and accuracy importance measures from univariate random forest analyses across multiple species, sampling devices, and surveys, gradient forest obtains a monotonic function of each predictor that represents the compositional turnover along the gradient of the predictor. When applied to a synthetic data set, the method correctly identified the important predictors and delineated where the compositional change points occurred along these gradients. Application of gradient forest to a real data set from part of the Great Barrier Reef identified mud fraction of the sediment as the most important predictor, with highest compositional turnover occurring at mud fraction values around 25%, and provided similar information for other predictors. Such refined information allows for more accurate capturing of biodiversity patterns for the purposes of bioregionalization, delineation of protected areas, or designing of biodiversity surveys.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.