2023
DOI: 10.21203/rs.3.rs-2432013/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MODILM: Towards Better Complex Diseases Classification Using a Novel Multi-omics Data Integration Learning Model

Abstract: Background Complex disease classification is an important part of the complex disease diagnosis and personalized treatment process. It has been shown that the integration of multi-omics data can analyze and classify complex diseases more accurately, because multi-omics data are highly correlated with the onset and progression of various diseases and can provide comprehensive and complementary information about a disease. However, multi-omics data of complex diseases are usually characterized by high imbalance… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 47 publications
0
3
0
Order By: Relevance
“…Fourteen computational approaches are investigated, which include six classic classifiers employing early integration strategies: K-nearest neighbor classifier (KNN) [30], support vector machine (SVM) [31], Lasso [32], random forest (RF) [33], XGboost [34], and fully connected neural networks (NN) [35]. Furthermore, five multi-omics classifiers are analyzed: group-regularized ridge regression [36], BPLSDA for projecting data into latent structures with discriminant analysis [37], block PLSDA with additional sparse constraints (BSPLSDA) [37], concatenation of final representations (CF) [38] for late-stage multi-omics data, gated multimodal fusion (GMU) [39] that integrates intermediate representations, along with three advanced methods, Mogonet [19], MODILM [40], and Dynamic [18]. select and utilize the informative modalities, thus ensuring a more precise characterization of each subject.…”
Section: Resultsmentioning
confidence: 99%
“…Fourteen computational approaches are investigated, which include six classic classifiers employing early integration strategies: K-nearest neighbor classifier (KNN) [30], support vector machine (SVM) [31], Lasso [32], random forest (RF) [33], XGboost [34], and fully connected neural networks (NN) [35]. Furthermore, five multi-omics classifiers are analyzed: group-regularized ridge regression [36], BPLSDA for projecting data into latent structures with discriminant analysis [37], block PLSDA with additional sparse constraints (BSPLSDA) [37], concatenation of final representations (CF) [38] for late-stage multi-omics data, gated multimodal fusion (GMU) [39] that integrates intermediate representations, along with three advanced methods, Mogonet [19], MODILM [40], and Dynamic [18]. select and utilize the informative modalities, thus ensuring a more precise characterization of each subject.…”
Section: Resultsmentioning
confidence: 99%
“…Note that some literature studies using TCGA datasets for testing classification models [26,27,28,29,30] already exist. However, these studies typically restrict their analysis to a maximum of four TCGA datasets, without providing clear justification for their choices.…”
Section: Tcga Datasetsmentioning
confidence: 99%
“…In this context, a reductionist standpoint makes it challenging to perform differential diagnoses on pathologies with similar clinical characteristics. Recent advances in biological and medical data sources, such as ‐omics, have called into question an oversimplified view of complex biological processes that fails to account for the interconnected nature of various body systems 2–7 . Emerging technologies and the growing field of precision medicine are paving the way for a shift away from this traditional perspective.…”
Section: Introductionmentioning
confidence: 99%