Abstract. Long non-coding RNAs (lncRNAs) emerged as key regulators of diverse roles during colorectal cancer (CRC) carcinogenesis, but their specific function still remains to be explored. The present study aimed to re-annotate the Affymetrix Human Exon 1.0 ST Array for defining differential lncRNAs in CRC. Their prognostic relevance was also developed for screening key regulators in CRC. The CRC datasets E-GEOD-31737, E-MATB-829, Affymetrix colon cancer dataset and E-GEOD-24550 were re-purposed for searching differential lncRNAs and exploring their association with overall survival (OS). The identified lncRNAs were validated in CRC tissues or cell lines. As a result, 462, 286 and 166 differential lncRNAs were identified, respectively, in three predictive datasets. Among them, 48 lncRNAs were commonly observed to exhibit differential expression in the three datasets. Notably, the overexpression of family with sequence similarity 83 member H (FAM83H)-antisense (AS) 1 (P=0.038) and VPS9 domain containing 1 (VPS9D1)-AS1 (P=0.020) indicated shorter OS time than lower expression. The overexpression of FAM83H-AS1 (P=0.033) and VPS9D1-AS1 (P=0.011) was validated in cancerous tissues. Thus, FAM83H-AS1 and VPS9D1-AS1 may potentially enhance carcinogenesis or may be developed as prognostic biomarkers for CRC. In conclusion, a total of 48 CRC-related lncRNAs were identified, the majority of which were confirmed to exhibit dysregulation. FAM83H-AS1 and VPS9D1-AS1 could have a potential use as prognostic biomarkers for CRC patients.
IntroductionColorectal cancer (CRC) is a malignant disease that originates from colorectal epithelial cells (1), and is one of the most commonly diagnosed malignancies in the world (2). Genetic events such as rare or high-penetrance variants in the CRC susceptibility genes and DNA mismatch repair genes have been demonstrated to be important in the etiology of both sporadic and familial CRC (3). However, the carcinogenic mechanisms can only be explained in <6% of all CRC cases (3). Therefore, there are still numerous genetic events associated with dysregulation or mutations in CRC patients that remain to be determined.Long non-coding RNAs (lncRNAs) are a class of transcripts longer than 200 nucleotides without coding potential to be translated into proteins (4). Numerous studies have revealed that lncRNAs were frequently dysregulated in various diseases and had multiple functions in a wide range of pathological processes, including apoptosis, proliferation and invasion-metastasis of malignant tumors (5,6). For instance, colorectal cancer associated transcript 2 was identified by Ling et al as a lncRNA mapping to 8q24 that promoted metastatic progression in CRC (7). Another lncRNA, homeobox transcript antisense intergenic RNA (HOTAIR) has been determined to exhibit higher levels in the plasma of CRC patients than in healthy controls, and its overexpression predicted unfavorable prognosis (8). The association between prognosis of CRC patients and expression of prostate cancer associated transcript 1...