2021
DOI: 10.1016/j.neucom.2020.08.091
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing random forest classification with NLP in DAMEH: A system for DAta Management in eHealth Domain

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(2 citation statements)
references
References 48 publications
0
2
0
Order By: Relevance
“…(2) The bootstrap methods, which are based on the RF algorithm, resample to generate multiple training sets, construct the supervised classification RF algorithm model, and use the mean squared error (MSE) of the out-of-bag (OOB) estimation method to evaluate the importance of the explanatory variables in the regression model. The mathematical definition of MSE is as follows [27]:…”
Section: Discussionmentioning
confidence: 99%
“…(2) The bootstrap methods, which are based on the RF algorithm, resample to generate multiple training sets, construct the supervised classification RF algorithm model, and use the mean squared error (MSE) of the out-of-bag (OOB) estimation method to evaluate the importance of the explanatory variables in the regression model. The mathematical definition of MSE is as follows [27]:…”
Section: Discussionmentioning
confidence: 99%
“…An alternative to word embeddings, also used in this study, is the feature-based approaches [84] that break up text into individual words called bag of words (BOW) and treat each word (unigram) or a contiguous sequence of words (2, 3, or more (n) words together referred to as bigrams, trigrams, or n-grams, accordingly) as a potential feature. The splitting of text into words (tokens) is referred to as tokenization.…”
Section: Text Vectorization Techniquesmentioning
confidence: 99%