We recently described a methodology that reliably predicted chemotherapeutic response in multiple independent clinical trials. The method worked by building statistical models from gene expression and drug sensitivity data in a very large panel of cancer cell lines, then applying these models to gene expression data from primary tumor biopsies. Here, to facilitate the development and adoption of this methodology we have created an R package called pRRophetic. This also extends the previously described pipeline, allowing prediction of clinical drug response for many cancer drugs in a user-friendly R environment. We have developed several other important use cases; as an example, we have shown that prediction of bortezomib sensitivity in multiple myeloma may be improved by training models on a large set of neoplastic hematological cell lines. We have also shown that the package facilitates model development and prediction using several different classes of data.
We demonstrate a method for the prediction of chemotherapeutic response in patients using only before-treatment baseline tumor gene expression data. First, we fitted models for whole-genome gene expression against drug sensitivity in a large panel of cell lines, using a method that allows every gene to influence the prediction. Following data homogenization and filtering, these models were applied to baseline expression levels from primary tumor biopsies, yielding an in vivo drug sensitivity prediction. We validated this approach in three independent clinical trial datasets, and obtained predictions equally good, or better than, gene signatures derived directly from clinical data.
Cell line drug screening datasets can be utilized for a range of different drug discovery applications from drug biomarker discovery to building translational models of drug response. Previously, we described three separate methodologies to (1) correct for general levels of drug sensitivity to enable drug-specific biomarker discovery, (2) predict clinical drug response in patients and (3) associate these predictions with clinical features to perform in vivo drug biomarker discovery. Here, we unite and update these methodologies into one R package (oncoPredict) to facilitate the development and adoption of these tools. This new OncoPredict R package can be applied to various in vitro and in vivo contexts for drug and biomarker discovery.
Large interindividual variance has been observed in sensitivity to drugs. To comprehensively decipher the genetic contribution to these variations in drug susceptibility, we present a genome-wide model using human lymphoblastoid cell lines from the International HapMap consortium, of which extensive genotypic information is available, to identify genetic variants that contribute to chemotherapeutic agent-induced cytotoxicity. Our model inte- HapMap ͉ pharmacogenomics ͉ toxicity ͉ whole-genome association
Gene expression is a complex quantitative trait partially regulated by genetic variation in DNA sequence. Population differences in gene expression could contribute to some of the observed differences in susceptibility to common diseases and response to drug treatments. We characterized gene expression in the full set of HapMap lymphoblastoid cell lines derived from individuals of European and African ancestry for 9156 transcript clusters (gene-level) evaluated with the Affymetrix GeneChip Human Exon 1.0 ST Array. Gene expression was found to differ significantly between these samples for 383 transcript clusters. Biological processes including ribosome biogenesis and antimicrobial humoral response were found to be enriched in these differential genes, suggesting their possible roles in contributing to the population differences at a higher level than that of mRNA expression and in response to environmental information. Genome-wide association studies for local or distant genetic variants that correlate with the differentially expressed genes enabled identification of significant associations with one or more single-nucleotide polymorphisms (SNPs), consistent with the hypothesis that genetic factors and not simply population identity or other characteristics (age of cell lines, length of culture, etc.) contribute to differences in gene expression in these samples. Our results provide a comprehensive view of the genes differentially expressed between populations and the enriched biological processes involved in these genes. We also provide an evaluation of the contributions of genetic variation and nongenetic factors to the population differences in gene expression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.