3051 Background: Orphan non-coding RNAs (oncRNAs) are a novel category of small RNAs (smRNAs) that are present in tumors and largely absent in healthy tissue. We investigated the utility of oncRNAs extracted from serum for early cancer detection across seven cancer types. Methods: We collected 2,882 serum samples from individuals with known cancers of the bladder ( n=152), breast (220), colon and rectum (141), kidney (283), lung (281), pancreas (287), and stomach (280) as well as donors with no history of cancer (1,238). We used 0.5 mL serum aliquots to generate and sequence smRNA libraries at an average depth of 20 million 50-bp single-end reads. Samples were split into age-, sex-, and smoking status-matched training (1,232 cancer; 922 control) and validation (412 cancer; 316 control) cohorts. A large catalog of oncRNAs specific to each cancer was created using tumor and adjacent normal samples from The Cancer Genome Atlas (TCGA) smRNA-seq database. Using TCGA-derived oncRNAs, we trained a machine learning model to predict cancer presence and tissue of origin (TOO) in a 5-fold cross validation setup using our training cohort. For the validation cohort, we averaged the predictions from the five training cohort models. Results: The model ROC-AUC for detecting cancer was 0.95 (95% CI: 0.94–0.95 for training and 0.94–0.97 for validation cohorts). Sensitivities for detecting cancer at 95% specificity were 0.74 (0.70–0.76) for early stage (I/II) and 0.80 (0.76–0.84) for late stage (III/IV) cancers in the training cohort, and 0.77 (0.71–0.81) and 0.81 (0.73–0.87) in the validation cohort. Sensitivities of detection for each cancer type are shown. For samples with cancer and TOO predictions, our top 1 and top 2 TOO accuracy was 0.76 (0.68–0.84) and 0.83 (0.76–0.90) for the validation set. Conclusions: These results demonstrate that oncRNAs detected in serum can be used for accurate, early detection, and localization of multiple cancers. [Table: see text]
e15651 Background: Colorectal cancer (CRC) is a highly heterogeneous disease with variable response to different treatment strategies. The consensus molecular subtype (CMS) classification system has been proposed as a gene expression-based framework aimed at capturing the genetic and molecular heterogeneity in CRCs and potentially stratifying patients into clinically relevant subgroups for treatment selection. We previously demonstrated that orphan non-coding RNAs (oncRNAs), a novel class of cancer enriched small non-coding RNAs (smRNA), exhibit cancer-specific expression. We also showed that oncRNAs can be used in a liquid biopsy assay for accurate early-stage detection of CRCs. We hypothesize that oncRNAs may be informative of CRC subtype-specific transcriptional signatures as defined by the CMS groups. Methods: To assess the effectiveness of oncRNAs for categorization of CMS group, we examined smRNA-seq data from The Cancer Genome Atlas (TCGA) for colon and rectal adenocarcinoma tumors ( n= 504), consisting of 77 CMS1, 216 CMS2, 71 CMS3, and 140 CMS4 tumors. CRC-specific oncRNA expression profiles were generated for all tumors in the TCGA cohort. We then trained a machine learning model on oncRNA expression profiles to predict CMS groups using a 5-fold cross validation setup. For our validation cohort, we applied our model to an independent CRC cohort ( n= 348) consisting of 56 CMS1, 101 CMS2, 59 CMS3, and 132 CMS4 CRC samples. Final CMS predictions for the validation cohort were made by averaging across the five training models. Results: Within TCGA testing folds, the model micro-averaged ROC-AUC for CMS classifications were 0.879 (95% CI: 0.86–0.90). In the validation cohort, micro-averaged AUC remained high at 0.915 (0.90–0.93). Sensitivities and AUCs for each subtype are provided in Table 1. The highest model-predicted likelihood for each CMS subtype had an accuracy of 73.6% (68.6%–78.1%), and the two most probable predictions had an accuracy of 91.1% (87.6%–93.9%). Conclusions: This study demonstrates the utility of oncRNA expression profiling to predict CMS for colorectal cancer. The generalizability of our machine learning model suggests that oncRNAs are informative of the biological differences across distinct CMS groups and may be used as a proxy for gene expression signatures to stratify colorectal cancers. Because smRNAs are more likely to be secreted into blood, we posit that an oncRNA-based liquid biopsy may open new opportunities to monitor patients and track changes in their tumor biology in real-time throughout their course of disease and treatment. [Table: see text]
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.