2020
DOI: 10.3390/healthcare8040392
|View full text |Cite
|
Sign up to set email alerts
|

Classification of Biomedical Texts for Cardiovascular Diseases with Deep Neural Network Using a Weighted Feature Representation Method

Abstract: This study aims to improve the performance of multiclass classification of biomedical texts for cardiovascular diseases by combining two different feature representation methods, i.e., bag-of-words (BoW) and word embeddings (WE). To hybridize the two feature representations, we investigated a set of possible statistical weighting schemes to combine with each element of WE vectors, which were term frequency (TF), inverse document frequency (IDF) and class probability (CP) methods. Thus, we built a multiclass cl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…With this method, we are able to retrieve the useful data from an unstructured log file. Many studies have used the BoW method [56][57][58][59], and in a recent study, combining BoW with word embeddings, Ahmed et al showed how to improve classification of biomedical texts [60].…”
Section: Bag Of Words (Bow)mentioning
confidence: 99%
“…With this method, we are able to retrieve the useful data from an unstructured log file. Many studies have used the BoW method [56][57][58][59], and in a recent study, combining BoW with word embeddings, Ahmed et al showed how to improve classification of biomedical texts [60].…”
Section: Bag Of Words (Bow)mentioning
confidence: 99%
“…Text classification tasks within the medical domain primarily benefited from domain-specific features, often generated via the utilization of knowledge sources such as the unified medical language system (UMLS) [ 9 ]. With the emergence of methods for generating effective numeric representations of texts or word embeddings (dense vectors), coupled with advances in computational capabilities, deep neural network based approaches became dominant in this space, obtaining SOTA performances in many text classification tasks [ 10 , 11 ]. Such approaches use dense vector representations, and generally require large volumes of annotated data.…”
Section: Introductionmentioning
confidence: 99%
“…9 With the emergence of methods for generating effective numeric representations of texts or word embeddings (dense vectors), coupled with advances in computational capabilities, deep neural network based approaches became dominant in this space, obtaining SOTA performances in many text classification tasks. 10,11 Such approaches use dense vector representations, and generally require large volumes of annotated data. Word embedding generation approaches such as Word2Vec 12 and GLoVe 13 are capable of effectively capturing semantic representations of words/phrases ( ie ., text fragments with similar meanings appear close together in vector space), which n-gram based approaches were not capable of.…”
Section: Introductionmentioning
confidence: 99%