Ovarian cancer is the third most common gynecologic cancers worldwide. Advanced ovarian cancer patients bear a significant mortality rate. Survival estimation is essential for clinicians and patients to understand better and tolerate future outcomes. The present study intends to investigate different survival predictors available for cancer prognosis using data mining techniques. Dataset of 140 advanced ovarian cancer patients containing data from different data profiles (clinical, treatment, and overall life quality) has been collected and used to foresee cancer patients’ survival. Attributes from each data profile have been processed accordingly. Clinical data has been prepared corresponding to missing values and outliers. Treatment data including varying time periods were created using sequence mining techniques to identify the treatments given to the patients. And lastly, different comorbidities were combined into a single factor by computing Charlson Comorbidity Index for each patient. After appropriate preprocessing, the integrated dataset is classified using appropriate machine learning algorithms. The proposed integrated model approach gave the highest accuracy of 76.4% using ensemble technique with sequential pattern mining including time intervals of 2 months between treatments. Thus, the treatment sequences and, most importantly, life quality attributes significantly contribute to the survival prediction of cancer patients.
Background:Cancer treatment is expensive and results in a lot of side effects, and thus
survival prediction is necessary for the patients as well as the clinician. Data mining technology
has been used in the medical domain to extract interesting information. Cancer prognosis is such
an application in medicine.Objective:This study focuses on identifying the technologies used in the recent past for predicting
the survival of cancer patients. Supervised, semi-supervised and unsupervised techniques have
been used over the years successfully for the survival prediction of different types of cancer.Methods:A systematic literature review process has been followed in this study to discover the
future directions of the research. This study focuses on uncovering the gaps in recent studies.Results and Conclusion:It has been found that the present system lacks structured information of
the patients. Also, there are a lot of different cancer types that are still unexplored in terms of
survival prediction, mainly due to the unavailability of sufficient data. Hence a lot can be
improved if researchers may get their hands on required data for the research.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.