2020
DOI: 10.1101/2020.02.10.942623
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

corto: a lightweight R package for Gene Network Inference and Master Regulator Analysis

Abstract: AbstractMotivationGene Network Inference and Master Regulator Analysis (MRA) have been widely adopted to define specific transcriptional perturbations from gene expression signatures. Several tools exist to perform such analyses, but most require a computer cluster or large amounts of RAM to be executed.ResultsWe developed corto, a fast and lightweight R package to infer gene netwo… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 8 publications
(12 citation statements)
references
References 15 publications
0
12
0
Order By: Relevance
“…Coexpression-based networks were generated using the corto algorithm with default parameters, freely available on the CRAN repository of R packages [27], using the SARS-CoV-2/human interactome proteins derived from [26] and the largest human Lung RNA-Seq dataset available from the GTEX consortium, generated from 427 patients gene expression profiles [28]. In brief, corto calculates a coexpression network for each protein and then removes indirect interactions using Data Processing Inequality [29], which has been shown to provide a more robust readout of single protein abundance alone [30].…”
Section: Coexpression-based Lung Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…Coexpression-based networks were generated using the corto algorithm with default parameters, freely available on the CRAN repository of R packages [27], using the SARS-CoV-2/human interactome proteins derived from [26] and the largest human Lung RNA-Seq dataset available from the GTEX consortium, generated from 427 patients gene expression profiles [28]. In brief, corto calculates a coexpression network for each protein and then removes indirect interactions using Data Processing Inequality [29], which has been shown to provide a more robust readout of single protein abundance alone [30].…”
Section: Coexpression-based Lung Networkmentioning
confidence: 99%
“…Master Regulator Analyses were performed by comparing infected and mock samples in both MERS and SARS datasets separately with the corto algorithm [27] using default parameters and the coexpression network derived from human Lung samples. In brief, a gene-by-gene signature of viral-induced differential expression is generated, and a combined value for each coexpression network is generated by weighting every gene's likelihood in the network, providing a final Normalized Enrichment Score for each human/SARS-nCoV-2 interactome member, which is positive if the network is upregulated by the infection, and negative if it's downregulated, as in [36].…”
Section: Master Regulator Analysismentioning
confidence: 99%
“…The PR AUC values are low, as is generally the case in eukaryotic GRN inference (Chen and Mar, 2018). Our comparison with several established gene expression-based GRN approaches, such as PANDA (Glass et al, 2013), ARACNE (Margolin et al, 2006), and corto (Mercatelli et al, 2020), shows that these methods also result in low PR AUC values. In addition to the improved performance, ANANSE has another clear benefit.…”
Section: Discussionmentioning
confidence: 93%
“…As the previously described reference datasets contain regulatory interactions for all genes, regardless of cell type, we created high confidence cell type-specific reference data by filtering for TFs and genes that are expressed, using a stringent cutoff of TPM < 10. As comparison, we included three GRN inference methods that were used to create tissue-specific networks using GTEx expression data: PANDA (Glass et al, 2013), ARACNE (Margolin et al, 2006), and corto (Mercatelli et al, 2020).…”
Section: Resultsmentioning
confidence: 99%
“…We considered only missense variants affecting specific AAs in the protein sequence, for a total of 155 entries ( Supplementary File 2 ). Graph generation was performed with the R statistical software and the corto package v1.1.2 (Mercatelli, Lopez-Garcia et al., 2020 ).…”
Section: Methodsmentioning
confidence: 99%