Integrated omics: tools, advances and future approaches

Misra, Biswapriya B.; Langefeld, Carl D.; Olivier, Michael; Cox, Laura A.

doi:10.1530/jme-18-0055

Cited by 392 publications

(267 citation statements)

References 162 publications

Supporting

Mentioning

262

Contrasting

Unclassified

Order By: Relevance

“…In the early-integration approach, also known as juxtaposition-based, the multi-omics datasets are first concatenated into one matrix. To deal with the high-dimensionality of the joint dataset, these methods generally adopt matrix factorization (68,53,55,52), statistical (46,69,70,59,57,44,71,72,73,55), and machine learning tools (74,73,55). Although the dimensionality reduction procedure is necessary and may improve the predictive performance, it can also cause the loss of key information (66).…”

Section: Background and Related Workmentioning

confidence: 99%

Integrative Network Fusion: a multi-omics approach in molecular profiling

Chierici

Bussola

Marcolini

et al. 2020

Preprint

View full text Add to dashboard Cite

Recent technological advances and international efforts, such as The Cancer Genome Atlas (TCGA), have made available several pan-cancer datasets encompassing multiple omics layers with detailed clinical information in large collection of samples. The need has thus arisen for the development of computational methods aimed at improving cancer subtyping and biomarker identification from multi-modal data. Here we apply the Integrative Network Fusion (INF) pipeline, which combines multiple omics layers exploiting Similarity Network Fusion (SNF) within a machine learning predictive framework. INF includes a feature ranking scheme (rSNF) on SNF-integrated features, used by a classifier over juxtaposed multi-omics features (juXT). In particular, we show instances of INF implementing Random Forest (RF) and linear Support Vector Machine (LSVM) as the classifier, and two baseline RF and LSVM models are also trained on juXT. A compact RF model, called rSNFi, trained on the intersection of top-ranked biomarkers from the two approaches juXT and rSNF is finally derived. All the classifiers are run in a 10x5-fold crossvalidation schema to warrant reproducibility, following the guidelines for an unbiased Data Analysis Plan by the US FDA-led initiatives MAQC/SEQC. INF is demonstrated on four classification tasks on three multi-modal TCGA oncogenomics datasets. Gene expression, protein abundances and copy number variants are used to predict estrogen receptor status (BRCA-ER, N=381) and breast invasive carcinoma subtypes (BRCA-subtypes, N=305), while gene expression, miRNA expression and methylation data is used as predictor layers for acute myeloid leukemia and renal clear cell carcinoma survival (AML-OS, N=157; KIRC-OS, N=181). In test, INF achieved similar Matthews Correlation Coefficient (MCC) values and 97% to 83% smaller feature sizes (FS), compared with juXT for BRCA-ER (MCC: 0.83 vs 0.80; FS: 56 vs 1801) and BRCA-subtypes 1 Chierici et al. INF(0.84 vs 0.80; 302 vs 1801), improving KIRC-OS performance (0.38 vs 0.31; 111 vs 2319). INF predictions are generally more accurate in test than one-dimensional omics models, with smaller signatures too, where transcriptomics consistently play the leading role. Overall, the INF framework effectively integrates multiple data levels in oncogenomics classification tasks, improving over the performance of single layers alone and naive juxtaposition, and provides compact signature sizes 1 .

show abstract

Section: Background and Related Workmentioning

confidence: 99%

Integrative Network Fusion: a multi-omics approach in molecular profiling

Chierici

Bussola

Marcolini

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…This is also unlike the case of genomics where tools and methods for both de novo and reference‐guided assembly and annotations are more or less standardized. More extensive discussions on the advantages and disadvantages of ‐omics methods and workflows toward an integrated ‐omics approach are discussed elsewhere . To this end, newer large‐scale approaches, such as fishing for protein‐binding metabolites either using in vitro techniques or in silico approaches, have shown tremendous potential for the capture of the metabolite counterparts of macromolecules (i.e., proteins, and nucleic acids) and have helped pinpoint metabolite binding sites on a proteome‐wide scale.…”

Section: Coverage and Comparability With Other‐omics For A Systems Viewmentioning

confidence: 99%

Challenges and Opportunities in Cancer Metabolomics

Kumar

Misra

2019

Proteomics

View full text Add to dashboard Cite

Challenges in metabolomics for a given spectrum of disease are more or less comparable, ranging from the accurate measurement of metabolite abundance, compound annotation, identification of unknown constituents, and interpretation of untargeted and analysis of high throughput targeted metabolomics data leading to the identification of biomarkers. However, metabolomics approaches in cancer studies specifically suffer from several additional challenges and require robust ways to sample the cells and tissues in order to tackle the constantly evolving cancer landscape. These constraints include, but are not limited to, discriminating the signals from given cell types and those that are cancer specific, discerning signals that are systemic and confounded, cell culture‐based challenges associated with cell line identities and media standardizations, the need to look beyond Warburg effects, citrate cycle, lactate metabolism, and identifying and developing technologies to precisely and effectively sample and profile the heterogeneous tumor environment. This review article discusses some of the current and pertinent hurdles in cancer metabolomics studies. In addition, it addresses some of the most recent and exciting developments in metabolomics that may address some of these issues. The aim of this article is to update the oncometabolomics research community about the challenges and potential solutions to these issues.

show abstract

“…In particular omics-research (genomics, proteomics, metabolomics etc.) is leading the charge to the growth of Big data [6,7]. The challenges in omics-research are data cleaning, normalization, biomolecule identification, data dimensionality reduction, biological contextualization, statistical validation, data storage and handling, sharing and data archiving.…”

Section: Big Medic Datamentioning

confidence: 99%

“…Data analytics requirements include several tasks like those of data cleaning, normalization, biomolecule identification, data dimensionality reduction, biological contextualization, statistical validation, data storage and handling, sharing and data archiving. These tasks are required for the Big data in some of the omics datasets like genomics, transcriptomics, proteomics, metabolomics, metagenomics, phenomics [6].…”

Section: Big Medic Datamentioning

confidence: 99%

Information technologies of 21st century and their impact on the society

Yamin

2019

Int. j. inf. tecnol.

View full text Add to dashboard Cite

Twenty first century has witnessed emergence of some ground breaking information technologies that have revolutionised our way of life. The revolution began late in 20th century with the arrival of internet in 1995, which has given rise to methods, tools and gadgets having astonishing applications in all academic disciplines and business sectors. In this article we shall provide a design of a 'spider robot' which may be used for efficient cleaning of deadly viruses. In addition, we shall examine some of the emerging technologies which are causing remarkable breakthroughs and improvements which were inconceivable earlier. In particular we shall look at the technologies and tools associated with the Internet of Things (IoT), Blockchain, Artificial Intelligence, Sensor Networks and Social Media. We shall analyse capabilities and business value of these technologies and tools. As we recognise, most technologies, after completing their commercial journey, are utilised by the business world in physical as well as in the virtual marketing environments. We shall also look at the social impact of some of these technologies and tools.

show abstract

Integrated omics: tools, advances and future approaches

Cited by 392 publications

References 162 publications

Integrative Network Fusion: a multi-omics approach in molecular profiling

Integrative Network Fusion: a multi-omics approach in molecular profiling

Challenges and Opportunities in Cancer Metabolomics

Information technologies of 21st century and their impact on the society

Contact Info

Product

Resources

About