Motivation: Although numerous methods have been developed to better capture biological information from microarray data, commonly used single gene-based methods neglect interactions among genes and leave room for other novel approaches. For example, most classification and regression methods for microarray data are based on the whole set of genes and have not made use of pathway information. Pathway-based analysis in microarray studies may lead to more informative and relevant knowledge for biological researchers.
Results: In this paper, we describe a pathway-based classification and regression method using Random Forests to analyze gene expression data. The proposed methods allow researchers to rank important pathways from externally available databases, discover important genes, find pathway-based outlying cases and make full use of a continuous outcome variable in the regression setting. We also compared Random Forests with other machine learning methods using several datasets and found that Random Forests classification error rates were either the lowest or the second-lowest. By combining pathway information and novel statistical methods, this procedure represents a promising computational strategy in dissecting pathways and can provide biological insight into the study of microarray data.
Availability: Source code written in R is available from
Contact: hongyu.zhao@yale.edu
Supplementary Information: Supplementary Data are available at
a b s t r a c tIn this paper, the fractional derivatives in the sense of modified Riemann-Liouville derivative and the first integral method are employed for constructing the exact solutions of nonlinear time-fractional partial differential equations. The power of this manageable method is presented by applying it to several examples. This approach can also be applied to other nonlinear fractional differential equations.
A retroviral vector system with high transfection efficiency into T lymphocytes and carrying a rapid screening label has been constructed, which establishes the foundation for basic and clinical studies on T lymphocytes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.