Abstract. Breast cancer is one of the leading causes of mortality in females. A number of prognostic markers have been identified, including single genes, multi-gene signatures and network modules; however, the robustness of these prognostic markers is insufficient. Thus, the present study proposed a more robust method to identify breast cancer prognostic modules based on weighted protein-protein interaction networks, by integrating four sets of disease-associated expression profiles. Three identified prognostic modules were closely associated with prognosis-associated functions and survival time, as determined by Cox regression and Kaplan-Meier survival analyses. The robustness of these modules was verified with an independent profile from another platform. Genes from these modules may be useful as breast cancer prognostic markers. The prognostic modules could be used to determine the prognoses of patients with breast cancer and characterize patient recovery.
IntroductionBreast cancer is one of the leading causes of female mortality (1-3). Prognosis reflects the outlook and chance of recovery from breast cancer. It is therefore critical to acquire prognostic information that represents the physical condition of the patients. Understanding prognostic information assists in increasing the survival rate and prolongs the life expectancy of patients (4).With the advent of microarray and next-generation sequencing technology, an increased volume of genomic data is available, including data relevant to breast cancer prognosis. In recent decades, a number of prognostic marker identification methods based on a single gene have been proposed (5). However, the detection of the expression of a single gene does not effectively characterize the expression of a gene cluster composed of tens or hundreds of genes. It has been demonstrated that prognostic information from multi-gene signatures, including network modules, can reduce the heterogeneity of diseases with greater accuracy than data of a single gene (6). For example, a classifying system for breast cancer profiles was constructed based on 70 genes in a study by van de Vijver et al (7). A treatment response prediction method for patients with breast cancer was developed based on 64 genes by Pawitan et al (8). The 21-gene signature has now been intensively studied and is widely used in clinical decisions regarding breast cancer (9).Prognostic marker identification methods have the problem of low robustness (10), and prognostic information obtained from certain patients may not be applicable to other patients (11). Thus, the present study aimed to develop a method to identify robust prognostic modules based on four breast cancer expression profiles and protein-protein interaction networks (PPINs). Experimental detection based on these prognostic modules could aid clinicians in the diagnosis of patients who are at risk of developing malignant disease, and allow the provision of prevention and treatment for these patients as early as possible.
Materials and methodsExpressio...