2020
DOI: 10.1007/s10844-020-00605-w
|View full text |Cite
|
Sign up to set email alerts
|

ETM: Enrichment by topic modeling for automated clinical sentence classification to detect patients’ disease history

Abstract: Given the rapid rate at which text data are being digitally gathered in the medical domain, there is growing need for automated tools that can analyze clinical notes and classify their sentences in electronic health records (EHRs). This study uses EHR texts to detect patients' disease history from clinical sentences. However, in EHRs, sentences are less topic-focused and shorter than that in general domain, which leads to the sparsity of co-occurrence patterns and the lack of semantic features. To tackle this … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 14 publications
(11 citation statements)
references
References 33 publications
0
11
0
Order By: Relevance
“…(i) Bag-of-Words . The BOW representation is the most commonly used representation for text mining applications [ 11 ]. Words in the reports were converted into a sparse multidimensional representation, which was leveraged for further classification and clustering purposes.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…(i) Bag-of-Words . The BOW representation is the most commonly used representation for text mining applications [ 11 ]. Words in the reports were converted into a sparse multidimensional representation, which was leveraged for further classification and clustering purposes.…”
Section: Methodsmentioning
confidence: 99%
“…This method has the advantage of using an interpretable lower-dimensional representation of text and the disadvantage of lacking the capacity of methods that use all features of unstructured medical notes. We ran the experiments fitting the LDA topic model with Gibbs sampling [ 11 , 40 ] using 10 topics. Figure 3 shows two topics of the output of LDA applied to the X-ray radiology reports in the SMART study.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The BOW representation is the most commonly used representation for text mining applications (Bagheri et al, 2020b). Words in the reports were converted into a sparse multidimensional representation, which was leveraged for further classification and clustering purposes.…”
Section: Bag-of-wordsmentioning
confidence: 99%
“…This method has the advantage of using an interpretable lower dimensional representation of text and the disadvantage of lacking the capacity of methods that use all features of unstructured medical notes. We ran the experiments fitting the LDA topic model with Gibbs sampling (Bagheri et al, 2020b;Blei et al, 2003) using 10 topics. Figure 4.3 shows two topics of the output of LDA applied to the x-ray radiology reports in the SMART study.…”
Section: Clustering-based Representationmentioning
confidence: 99%