2021
DOI: 10.1128/msphere.01245-20
|View full text |Cite
|
Sign up to set email alerts
|

DeORFanizing Candida albicans Genes using Coexpression

Abstract: Functional characterization of open reading frames in nonmodel organisms, such as the common opportunistic fungal pathogen Candida albicans, can be labor-intensive. To meet this challenge, we built a comprehensive and unbiased coexpression network for C. albicans, which we call CalCEN, from data collected from 853 RNA sequencing runs from 18 large-scale studies deposited in the NCBI Sequence Read Archive. Retrospectively, CalCEN is highly predictive of known gene function annotations and can be synergistically… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
27
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 17 publications
(27 citation statements)
references
References 59 publications
0
27
0
Order By: Relevance
“…In order to generate a comprehensive assessment of genes with evidence for essentiality in C. albicans , we first collected a set of functional genomic features for use in training a machine-learning model (Supplementary Data 1 ). These included features derived from a collection of gene expression datasets such as gene expression level (Transcripts Per Kilobase Million (TPM) median), gene expression variance, and degree of co-expression (number of partners in a co-expression network) 22 . It also encompassed a codon adaptation index (CAI), which measures the bias in codon usage across each gene 21 ; the number of SNPs per nucleotide for each gene across a set of sequenced C. albicans strains 23 ; the presence of an essential ortholog in S. cerevisiae 12 ; and the presence of a duplicated set of paralogs in S. cerevisiae that exhibited a synthetic sick/lethal genetic interaction 24 .…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…In order to generate a comprehensive assessment of genes with evidence for essentiality in C. albicans , we first collected a set of functional genomic features for use in training a machine-learning model (Supplementary Data 1 ). These included features derived from a collection of gene expression datasets such as gene expression level (Transcripts Per Kilobase Million (TPM) median), gene expression variance, and degree of co-expression (number of partners in a co-expression network) 22 . It also encompassed a codon adaptation index (CAI), which measures the bias in codon usage across each gene 21 ; the number of SNPs per nucleotide for each gene across a set of sequenced C. albicans strains 23 ; the presence of an essential ortholog in S. cerevisiae 12 ; and the presence of a duplicated set of paralogs in S. cerevisiae that exhibited a synthetic sick/lethal genetic interaction 24 .…”
Section: Resultsmentioning
confidence: 99%
“…We utilized co-expression clustering analysis to determine whether any biological processes are enriched among essential genes in the C. albicans genome. Using a non-linear dimensionality reduction, we clustered C. albicans genes based on co-expression 22 (Fig. 2c ) and used GO term enrichment of the genes in each cluster to assign a putative function.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…6 C), such as CSH1 , IFD6 , and ORF19.7214 ( Table S4 ). CSH1 and IFD6 have been predicted to participate in protein hydrolysis during cell-wall deconstruction ( O’Meara and O’Meara, 2021 ). ORF19.7214 is an uncharacterized gene in C. albicans , but its homologue in S. pombe , Exg3, encodes a glucan 1,6-β-glucosidase involved in cell-wall decomposition ( Dueñas-Santero et al, 2010 ).…”
Section: Discussionmentioning
confidence: 99%