2021
DOI: 10.1038/s41467-021-21671-w
|View full text |Cite
|
Sign up to set email alerts
|

Improving gene function predictions using independent transcriptional components

Abstract: The interpretation of high throughput sequencing data is limited by our incomplete functional understanding of coding and non-coding transcripts. Reliably predicting the function of such transcripts can overcome this limitation. Here we report the use of a consensus independent component analysis and guilt-by-association approach to predict over 23,000 functional groups comprised of over 55,000 coding and non-coding transcripts using publicly available transcriptomic profiles. We show that, compared to using P… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
23
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 26 publications
(23 citation statements)
references
References 35 publications
0
23
0
Order By: Relevance
“…4 ). Co-functionality analysis of commonly upregulated genes using the GenetICA algorithm 30 (see Supplementary Methods for details) pointed at roles for these genes in ncRNA processing, DNA repair, and ribosome biogenesis ( Fig. 3D ).…”
Section: Resultsmentioning
confidence: 99%
“…4 ). Co-functionality analysis of commonly upregulated genes using the GenetICA algorithm 30 (see Supplementary Methods for details) pointed at roles for these genes in ncRNA processing, DNA repair, and ribosome biogenesis ( Fig. 3D ).…”
Section: Resultsmentioning
confidence: 99%
“…To investigate LPAR6 effects on the inhibition of breast cancer progression, we investigated its significantly correlated or co-expressed genes via in silico analysis. An alternative approach is the concept of “guilt-by-association” (GBA) which assumes that if two proteins interact or share expression patterns, their functions are more likely to be related [ 36 , 37 ]. To reduce the influence of confounding factors, we sorted samples according to the expression level of LPAR6 and selected the first 200 and the last 200 samples to constitute two groups: “high”-level group and “low”-level group both in TCGA and METABRIC datasets, respectively.…”
Section: Resultsmentioning
confidence: 99%
“…We compare TripletGO with two most recently developed gene function prediction approaches, i.e., GENETICA [7] and GeneNetwork [34], which are both based on expression profiles. Different from our work, these two approaches are designed at the term-centric level.…”
Section: Resultsmentioning
confidence: 99%
“…For example, according to official statistics in the neXtProt platform [5], nearly 2,000 protein-coding human genes have yet no known function; for many other organisms of biomedical or industrial importance, annotation rates are substantially lower. To fill the gap between sequence and function, it is urgent to develop efficient computational algorithms for function prediction [6, 7].…”
Section: Introductionmentioning
confidence: 99%