2022
DOI: 10.1101/2022.08.14.503901
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Envirome-Wide Associations Enhance Multi-Year Genome-Based Prediction of Historical Wheat Breeding Data

Abstract: Linking high-throughput environmental data (enviromics) into genomic prediction (GP) is a cost-effective strategy for increasing selection intensity under genotype-by-environment interactions (GxE). This study developed a data-driven approach based on Environment-Phenotype Associations (EPA) aimed at recycling important GxE information from historical breeding data. EPA was developed in two applications: (1) scanning a secondary source of genetic variation, weighted from the shared reaction norms of past-evalu… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 69 publications
0
2
0
Order By: Relevance
“…Preliminary analysis investigated performing feature selection using gradient boosting machine learning models (Friedman, 2002 ) to select the most relevant ECs for each trait but this was not found to increase within‐trial prediction accuracy. This was supported by Costa‐Neto et al ( 2022 ) who also advocated using all ECs in ERM for prediction of new genotypes in new years. Therefore, a non‐linear kernel was calculated using all ECs according to Costa‐Neto, Fritsche‐Neto, et al ( 2021 ) and Costa‐Neto, Galli, et al ( 2021 ): where is the scaled EC matrix, is the Euclidean Distance between each element of the EC matrix ( n environments × m ECs), is a scaling factor assumed as the mean value of the Euclidean distance matrix and is a bandwidth factor which was assumed to be 1 as default.…”
Section: Methodsmentioning
confidence: 79%
See 1 more Smart Citation
“…Preliminary analysis investigated performing feature selection using gradient boosting machine learning models (Friedman, 2002 ) to select the most relevant ECs for each trait but this was not found to increase within‐trial prediction accuracy. This was supported by Costa‐Neto et al ( 2022 ) who also advocated using all ECs in ERM for prediction of new genotypes in new years. Therefore, a non‐linear kernel was calculated using all ECs according to Costa‐Neto, Fritsche‐Neto, et al ( 2021 ) and Costa‐Neto, Galli, et al ( 2021 ): where is the scaled EC matrix, is the Euclidean Distance between each element of the EC matrix ( n environments × m ECs), is a scaling factor assumed as the mean value of the Euclidean distance matrix and is a bandwidth factor which was assumed to be 1 as default.…”
Section: Methodsmentioning
confidence: 79%
“…This was supported by Costa-Neto et al (2022) who also advocated using all ECs in ERM for prediction of new genotypes in new where w is the scaled EC matrix, ‖w − w � ‖ 2 is the Euclidean Distance between each element of the EC matrix (n environments × m ECs), is a scaling factor assumed as the mean value of the Euclidean distance matrix and h is a bandwidth factor which was assumed to be 1 as default. The ERM therefore had dimensions of n environments × n environments.…”
Section: Envirotypingandenviromic Relationship Matrixmentioning
confidence: 98%