2023
DOI: 10.54097/ajst.v8i1.14333
|View full text |Cite
|
Sign up to set email alerts
|

Advancing Cancer Document Classification with R andom Forest

Chang Che,
Hao Hu,
Xinyu Zhao
et al.

Abstract: In this study, we address the challenging task of biomedical text document classification of Cancer Doc Classification, specifically focusing on lengthy research papers related to cancer. Unlike previous research that often deals with shorter abstracts and concise summaries, we curated a unique dataset comprising documents with more extensive content, each exceeding 6 pages in length. To tackle this classification challenge, we employed the Random Forest Tree method. Random Forest is a powerful ensemble… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 13 publications
0
9
0
Order By: Relevance
“…Because it is extremely difficult to obtain labels of fraudulent transactions, some scholars regard fraudulent samples as outliers and separate them from normal samples by anomaly detection technology. Van et al [9][10]. H used unsupervised anomaly detection technology to identify fraud samples of medical insurance claims, and the experimental results showed that potential new fraud patterns could be detected through anomaly detection technology.…”
Section: Related Workmentioning
confidence: 99%
“…Because it is extremely difficult to obtain labels of fraudulent transactions, some scholars regard fraudulent samples as outliers and separate them from normal samples by anomaly detection technology. Van et al [9][10]. H used unsupervised anomaly detection technology to identify fraud samples of medical insurance claims, and the experimental results showed that potential new fraud patterns could be detected through anomaly detection technology.…”
Section: Related Workmentioning
confidence: 99%
“…Different from innovation from model itself, lots of works proves a better feature selection will boost performance compare with the classic approach [4]: [7] proposed a method which use multiresolution wavelets and Zernike moments to get a better representation; [8] utilize Gabor wavelet and locality sensitive discriminant analysis in feature selection process. [9] introduce a Cross-correlation analysis in both spatial and wavelet domains before using PCA as feature selection method.…”
Section: Related Workmentioning
confidence: 99%
“…[8] introduce organize the data as a graph structure to apply graph convolution. Considering that healthcare datasets are generally less public and smaller in size, [9] proposes new data enhancement schemes to improve data quality.…”
Section: Related Workmentioning
confidence: 99%
“…At present, the radiative transfer model RF is used to characterize the climate effect of greenhouse gases, indicating the general trend of climate change. When RF>0, it will warm the ground and troposphere, and when RF0, it will cool the ground and troposphere [4]. 1 In terms of radiative forcing methods for calculating CO, the internationally recognized and widely used atmospheric radiation models are AER's atmospheric radiation model Fu-Liou model MODTRAN5, SBDART, SHDOM, etc.…”
Section: Introductionmentioning
confidence: 99%