2019
DOI: 10.1016/j.cmpb.2019.01.007
|View full text |Cite
|
Sign up to set email alerts
|

Extraction of risk factors for cardiovascular diseases from Chinese electronic medical records

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(7 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…Most of the selected publications involved corpora written in English: 94 of the 105 reviewed articles. Publications in languages other than English include [39][40][41][42] in Chinese, [43,44] in Korean, [45] in Dutch, [46] in Italian, [47] in Swedish, and [48] in Spanish. Additionally, [31] dealt with English and French, extracting temporal relations from both the THYME corpus and the MERLOT corpus [49], which are from medical texts in French.…”
Section: Global Quantitative Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Most of the selected publications involved corpora written in English: 94 of the 105 reviewed articles. Publications in languages other than English include [39][40][41][42] in Chinese, [43,44] in Korean, [45] in Dutch, [46] in Italian, [47] in Swedish, and [48] in Spanish. Additionally, [31] dealt with English and French, extracting temporal relations from both the THYME corpus and the MERLOT corpus [49], which are from medical texts in French.…”
Section: Global Quantitative Resultsmentioning
confidence: 99%
“…However, the results were still not comparable with those of traditional machine learning algorithms. For regular datasets, the authors of [40] extracted risk factors for cardiovascular diseases, similar to the i2b2/UTHealth 2014 shared-task objective, using a CNN-based model.…”
Section: Authorsmentioning
confidence: 99%
“…In addition, in the process of annotation of the military text corpus, the principle of "no overlapping, no nesting, and no pause punctuation" that is similar to that for literature should be followed. Since nonstandard Chinese word segmentation may cause error transmission problems for subsequent named entity recognition tasks, a BIO threesegment annotation method is used in the sentence-bysentence annotation process of military corpus [28], in which the starting word for each entity is marked as "Bentity type," the subsequent mark is "I-entity type," and O means nonentity part. erefore, there are a total of 31 types of tags in the labeled corpus, as shown in Table 1.…”
Section: Entity Annotation System and Methodmentioning
confidence: 99%
“…The first papers to report on NER models, based on deep learning and applied to EHRs, were published in 2015 [27]. By 2019, bidirectional long short-term memory (BiLSTM) had become the dominant architecture [7,[28][29][30][31][32][33][34][35][36][37][38][39][40][41][42][43][44][45]. That same year, three studies were published that utilized the bidirectional encoder representations from transformers (BERT) architecture [28,43,46].…”
Section: Classification Modelsmentioning
confidence: 99%