2022
DOI: 10.1111/tpj.15905
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised and semi‐supervised learning: the next frontier in machine learning for plant systems biology

Abstract: Advances in high-throughput omics technologies are leading plant biology research into the era of big data. Machine learning (ML) performs an important role in plant systems biology because of its excellent performance and wide application in the analysis of big data. However, to achieve ideal performance, supervised ML algorithms require large numbers of labeled samples as training data. In some cases, it is impossible or prohibitively expensive to obtain enough labeled training data; here, the paradigms of u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 48 publications
(17 citation statements)
references
References 115 publications
0
17
0
Order By: Relevance
“…Ensemble classifiers enhance decision-making by combining outputs from different models, introduced by Dietterich (2000) . Clustering-based methods, like the k-means algorithm, provide an unsupervised approach for predicting protein functions by exploiting direct and indirect interactions ( Hou, 2017 ; Yan and Wang, 2022 ).…”
Section: Basics Of Ai Techniquesmentioning
confidence: 99%
“…Ensemble classifiers enhance decision-making by combining outputs from different models, introduced by Dietterich (2000) . Clustering-based methods, like the k-means algorithm, provide an unsupervised approach for predicting protein functions by exploiting direct and indirect interactions ( Hou, 2017 ; Yan and Wang, 2022 ).…”
Section: Basics Of Ai Techniquesmentioning
confidence: 99%
“…Finally, there is the need to consider how meta‐learning techniques can be used to assist in micro‐level research (Mason et al., 2021). Genomic and proteomic data at multiple levels within plant systems are characterized by high latitude, redundancy, noise, uncertainty, and multiple sources (Yan & Wang, 2022), making them quite difficult to study. Some studies have attempted to use unsupervised and semi‐supervised learning to assist in the exploration and integration of knowledge with decent results.…”
Section: Prospects and Challengesmentioning
confidence: 99%
“…Various solutions have been proposed to address the issue of limited training data based on concerns that models trained on small datasets are vulnerable to overfitting. First, although the large majority of ecological applications use supervised learning, the development of unsupervised and self-supervised algorithms that circumvent the need for extensive labelled training data is an active area of research (for example, Yan and Wang 2022).…”
Section: Challenges For DL In Ecosystem Ecologymentioning
confidence: 99%