Yixuan Chen scite author profile

Identifying disease genes is crucial to the understanding of disease pathogenesis, and to the improvement of disease diagnosis and treatment. In recent years, many researchers have proposed approaches to prioritize candidate genes by considering the relationship of candidate genes and existing known disease genes, reflected in other data sources. In this paper, we propose an expandable framework for gene prioritization that can integrate multiple heterogeneous data sources by taking advantage of a unified graphic representation. Gene-gene relationships and gene-disease relationships are then defined based on the overall topology of each network using a diffusion kernel measure. These relationship measures are in turn normalized to derive an overall measure across all networks, which is utilized to rank all candidate genes. Based on the informativeness of available data sources with respect to each specific disease, we also propose an adaptive threshold score to select a small subset of candidate genes for further validation studies. We performed large scale cross-validation analysis on 110 disease families using three data sources. Results have shown that our approach consistently outperforms other two state of the art programs. A case study using Parkinson disease (PD) has identified four candidate genes (UBB, SEPT5, GPR37 and TH) that ranked higher than our adaptive threshold, all of which are involved in the PD pathway. In particular, a very recent study has observed a deletion of TH in a patient with PD, which supports the importance of the TH gene in PD pathogenesis. A web tool has been implemented to assist scientists in their genetic studies.

show abstract

Recent advances in seafood bioactive peptides and their potential for managing osteoporosis

Chen

Cheng

et al. 2020

Critical Reviews in Food Science and Nutrition

View full text Add to dashboard Cite

Dunaliella salina-derived peptide protects from bone loss: Isolation, purification and identification

Chen

Zheng

et al. 2021

LWT

View full text Add to dashboard Cite

Detecting Genome-Wide Haplotype Polymorphism by Combined Use of Mendelian Constraints and Local Population Structure

Li¹,

Chen²,

Li³

2009

View full text Add to dashboard Cite

Data from current gene-disease association studies motivate changes to existing haplotype inference methodologies. Many datasets are now comprised of both pedigree and population data so it is desirable to incorporate both sources of information when inferring haplotypes. The availability of high-density SNP data also makes it possible to determine and use the precise locations of recombination events. Our proposed method reconstructs haplotype structure on a genome-wide level by jointly using the information from the Mendelian law of inheritance and local population structure. The method combines in one framework new techniques of recombination event detection, maximum likelihood optimization of population haplotype diversity and our previous algorithm of zero-recombinant haplotype reconstruction. Experiments on both real and simulated datasets prove the efficiency and accuracy of our approach in reconstructing the haplotype structure. Our method makes it possible to reveal the haplotypic variation on a genome-wide level.

show abstract

Generation of Synthetic Data and Experimental Designs in Evaluating Interactions for Association Studies

Chen

2012

J. Bioinform. Comput. Biol.

View full text Add to dashboard Cite

Complex diseases, by definition, involve multiple factors, including gene-gene interactions and gene-environment interactions. Researchers commonly rely on simulated data to evaluate their approaches for detecting high-order interactions in disease gene mapping. A publicly available simulation program to generate samples involving complex genetic and environmental interactions is of great interest to the community. We have developed a software package named gs1.0, which has been widely used since its publication. In this article, we present an upgraded version gs2.0, which not only inherits its capacity to generate realistic genotype data but also provides great functionality and flexibility to simulate various interaction models. In addition to a standalone version, a user-friendly web server (http://cbc.case.edu/gs) has been set up to help users to build complex interaction models. Furthermore, by utilizing three three-locus models as an example, we have shown how realistic model parameters can be chosen in generating simulated data.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yixuan Chen

In Silico Gene Prioritization by Integrating Multiple Data Sources

Recent advances in seafood bioactive peptides and their potential for managing osteoporosis

Dunaliella salina-derived peptide protects from bone loss: Isolation, purification and identification

Detecting Genome-Wide Haplotype Polymorphism by Combined Use of Mendelian Constraints and Local Population Structure

Generation of Synthetic Data and Experimental Designs in Evaluating Interactions for Association Studies

Contact Info

Product

Resources

About