Background Cancer researchers use cell lines, patient-derived xenografts, engineered mice, and tumoroids as models to investigate tumor biology and to identify therapies. The generalizability and power of a model derive from the fidelity with which it represents the tumor type under investigation; however, the extent to which this is true is often unclear. The preponderance of models and the ability to readily generate new ones has created a demand for tools that can measure the extent and ways in which cancer models resemble or diverge from native tumors. Methods We developed a machine learning-based computational tool, CancerCellNet, that measures the similarity of cancer models to 22 naturally occurring tumor types and 36 subtypes, in a platform and species agnostic manner. We applied this tool to 657 cancer cell lines, 415 patient-derived xenografts, 26 distinct genetically engineered mouse models, and 131 tumoroids. We validated CancerCellNet by application to independent data, and we tested several predictions with immunofluorescence. Results We have documented the cancer models with the greatest transcriptional fidelity to natural tumors, we have identified cancers underserved by adequate models, and we have found models with annotations that do not match their classification. By comparing models across modalities, we report that, on average, genetically engineered mice and tumoroids have higher transcriptional fidelity than patient-derived xenografts and cell lines in four out of five tumor types. However, several patient-derived xenografts and tumoroids have classification scores that are on par with native tumors, highlighting both their potential as faithful model classes and their heterogeneity. Conclusions CancerCellNet enables the rapid assessment of transcriptional fidelity of tumor models. We have made CancerCellNet available as a freely downloadable R package (https://github.com/pcahan1/cancerCellNet) and as a web application (http://www.cahanlab.org/resources/cancerCellNet_web) that can be applied to new cancer models that allows for direct comparison to the cancer models evaluated here.
33 34 35 * These authors made equal contributions. 36 37 38 Correspondence to: patrick.cahan@jhmi.edu 39 40 Article type: Analysis 41 42 Website: http://www.cahanlab.org/resources/cancerCellNet_web 43 44 Code: https://github.com/pcahan1/cancerCellNet 45 46 47 48 49 2 ABSTRACT 50 51Cancer researchers use cell lines, patient derived xenografts, and genetically engineered mice 52 as models to investigate tumor biology and to identify therapies. The generalizability and power 53 of a model derives from the fidelity with which it represents the tumor type of investigation, 54 however, the extent to which this is true is often unclear. The preponderance of models and the 55 ability to readily generate new ones has created a demand for tools that can measure the extent 56 and ways in which cancer models resemble or diverge from native tumors. Here, we present a 57 computational tool, CancerCellNet, that measures the similarity of cancer models to 22 naturally 58 occurring tumor types and 36 subtypes, in a platform and species agnostic manner. We applied 59 this tool to 657 cancer cell lines, 415 patient derived xenografts, and 26 distinct genetically 60 engineered mouse models, documenting the most faithful models, identifying cancers 61 underserved by adequate models, and finding models with annotations that do not match their 62 classification. By comparing models across modalities, we find that genetically engineered mice 63 have higher transcriptional fidelity than patient derived xenografts and cell lines in four out of 64 five tumor types. We have made CancerCellNet available as freely downloadable software and 65 as a web application that can be applied to new cancer models. 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81Models are widely used to investigate cancer biology and to identify potential therapeutics. 82Popular modeling modalities are cancer cell lines (CCLs) 1 , genetically engineered mouse 83 models (GEMMs) 2 , and patient derived xenografts (PDXs) 3 .These classes of models differ in the 84 types of questions that they are designed to address. CCLs are often used to address cell 85 intrinsic mechanistic questions 4 , GEMMs to chart progression of molecularly defined-disease 5 , 86 and PDXs to explore patient-specific response to therapy in a physiologically relevant context 6 . 87Models also differ in the extent to which the they represent specific aspects of a cancer type 7 . 88Even with this intra-and inter-class model variation, all models should represent the tumor type 89 or subtype under investigation, and not another type of tumor, and not a non-cancerous tissue. 90Therefore, cancer-models should be selected not only based on the specific biological question 91 but also based on the similarity of the model to the cancer type under investigation 8,9 . 92 Various methods have been proposed to determine the similarity of cancer models to 93 their intended subjects. Domcke et al devised a 'suitability score' as a metric of the molecular 94 similarity of CCLs to high grade serous ovarian carcinom...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.