2023
DOI: 10.1186/s13007-023-01035-9
|View full text |Cite
|
Sign up to set email alerts
|

Application of machine learning algorithms and feature selection in rapeseed (Brassica napus L.) breeding for seed yield

Abstract: Background Studying the relationships between rapeseed seed yield (SY) and its yield-related traits can assist rapeseed breeders in the efficient indirect selection of high-yielding varieties. However, since the conventional and linear methods cannot interpret the complicated relations between SY and other traits, employing advanced machine learning algorithms is inevitable. Our main goal was to find the best combination of machine learning algorithms and feature selection methods to maximize t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 50 publications
0
2
0
Order By: Relevance
“…Data normalization and scaling is an important preprocessing procedure to remove data nuisance and improve the efficiency of learning from data 18 . Moreover, when different variables recorded have varied nature of recording i.e., continuous and binary, more care needs to be adopted to remove over/less fitting of the machine learning algorithms 19 .…”
Section: Methodsmentioning
confidence: 99%
“…Data normalization and scaling is an important preprocessing procedure to remove data nuisance and improve the efficiency of learning from data 18 . Moreover, when different variables recorded have varied nature of recording i.e., continuous and binary, more care needs to be adopted to remove over/less fitting of the machine learning algorithms 19 .…”
Section: Methodsmentioning
confidence: 99%
“…Machine learning practitioners employ optimization to find optimal parameters for machine learning algorithms like k-means clustering, support vector machines (SVM), or neural networks, which can handle high-dimensional data for tasks such as identifying disease subtypes or predicting outcomes [ 49 , 50 ]. When building predictive models from high-dimensional data, regularization and model selection techniques like L1 regularization (Lasso) and L2 regularization (Ridge) use optimization to balance model complexity and accuracy [ 51 53 ]. Model selection methods also rely on optimization to choose the best model hyperparameters.…”
Section: Introductionmentioning
confidence: 99%