More and more evidences demonstrate that the long non-coding RNAs (lncRNAs) play many key
roles in diverse biological processes. There is a critical need to annotate the functions
of increasing available lncRNAs. In this article, we try to apply a global network-based
strategy to tackle this issue for the first time. We develop a bi-colored network based
global function predictor, long non-coding RNA global function predictor
(‘lnc-GFP’), to predict probable functions for lncRNAs at large scale by
integrating gene expression data and protein interaction data. The performance of lnc-GFP
is evaluated on protein-coding and lncRNA genes. Cross-validation tests on protein-coding
genes with known function annotations indicate that our method can achieve a precision up
to 95%, with a suitable parameter setting. Among the 1713 lncRNAs in the bi-colored
network, the 1625 (94.9%) lncRNAs in the maximum connected component are all
functionally characterized. For the lncRNAs expressed in mouse embryo stem cells and
neuronal cells, the inferred putative functions by our method highly match those in the
known literature.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.