2021
DOI: 10.1016/j.csbj.2021.07.021
|View full text |Cite
|
Sign up to set email alerts
|

A primer on machine learning techniques for genomic applications

Abstract: Graphical abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 21 publications
(8 citation statements)
references
References 77 publications
0
8
0
Order By: Relevance
“…To make the dataset more balanced, a data augmentation algorithm, SMOTE (Synthetic Minority Over-sampling Technique) 44 , was adopted. As a classification algorithm, random forest was chosen using the default parameters of the Scikit-learn python package 40 due to its efficiency in handling datasets with a high number of features 45 . The final output was obtained as the average of the three probability values and the associated class was obtained from the probability value by imposing the classic threshold of 0.5.…”
Section: Resultsmentioning
confidence: 99%
“…To make the dataset more balanced, a data augmentation algorithm, SMOTE (Synthetic Minority Over-sampling Technique) 44 , was adopted. As a classification algorithm, random forest was chosen using the default parameters of the Scikit-learn python package 40 due to its efficiency in handling datasets with a high number of features 45 . The final output was obtained as the average of the three probability values and the associated class was obtained from the probability value by imposing the classic threshold of 0.5.…”
Section: Resultsmentioning
confidence: 99%
“…As a classification algorithm, random forest was chosen using the default parameters of the Scikit-learn python package 46 due to its efficiency in handling datasets with a high number of features. 75 The final output was obtained as the average of the three probability values and the associated class was obtained from the probability value by imposing the classic threshold of 0.5. The prediction model achieved an AUROC of 0.64 and an AUPRC of 0.48 on the Dream Challenge validation dataset.…”
Section: Methodsmentioning
confidence: 99%
“…Throughout the chapter, there have been examples showcasing the use of ML in combination with IoT devices in the biotechnology Industry 5.0. These models are used for an array of solutions in the biotechnology industry, including predictive analysis for crop yield [53], classification of genome data [54], enhanced manufacturing of biotechnology products [55], management of biotechnology labs [56], and more. Like other components of Industry 5.0, the ML models are also vulnerable.…”
Section: The Role Of Mlsecops In the Biotechnology Industry 50mentioning
confidence: 99%