Functional enrichment analysis is a fundamental and challenging task in bioinformatics. Most of the current enrichment analysis approaches individually evaluate functional terms and often output a list of enriched terms with high similarity and redundancy, which makes it difficult for downstream studies to extract the underlying biological interpretation. In this paper, we proposed a novel framework to assess the performance of combination-based enrichment analysis. Using this framework, we formulated the enrichment analysis as a multi-objective combinatorial optimization problem and developed the CEA (Combination-based Enrichment Analysis) method. CEA provides the whole landscape of term combinations; therefore, it is a good benchmark for evaluating the current state-of-the-art combination-based functional enrichment methods in a comprehensive manner. We tested the effectiveness of CEA on four published microarray datasets. Enriched functional terms identified by CEA not only involve crucial biological processes of related diseases, but also have much less redundancy and can serve as a preferable representation for the enriched terms found by traditional single-term-based methods. CEA has been implemented in the R package CopTea and is available at http://github.com/wulingyun/CopTea/.
BackgroundHigh-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes.ResultsIn this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub (http://github.com/wulingyun/CopTea/).ConclusionOur procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.Electronic supplementary materialThe online version of this article (doi:10.1186/s12918-017-0456-7) contains supplementary material, which is available to authorized users.
In order to meet the needs of intelligent operation and maintenance of distribution networks, a large number of wireless sensors are deployed inside and outside the power grid for digital sensing and immediate control of the grid. However, once these wireless sensors fail, they will generate erroneous data, which affects the assessment of grid security and brings great hindrance to the automation of power system operation and maintenance. Therefore, we propose an AOA and fingerprint recognition-based sensing node location method to achieve accurate positioning of wireless sensing nodes to facilitate rapid troubleshooting. The method first enhances and denoises the signal characteristics of wireless sensing nodes to achieve pre-processing of wireless sensing fingerprint information and solve the problem of fingerprint feature recognition of data samples. After that we combine the node fingerprint information through deep learning models to achieve device type recognition of wireless sensing nodes. Finally, we design the AOA node localisation method, which uses the AOA coordinate system measurement model to perform distance measurement and coordinate conversion on the fingerprint features of wireless sensing nodes to achieve accurate and fast localisation of wireless sensing devices. The experiments prove that the method has a high accuracy rate in the identification and location of wireless sensing node devices, which can effectively improve the efficiency of grid automation operation and accurate troubleshooting.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.