2022
DOI: 10.1155/2022/1679589
|View full text |Cite
|
Sign up to set email alerts
|

Information Extraction from the Text Data on Traditional Chinese Medicine: A Review on Tasks, Challenges, and Methods from 2010 to 2021

Abstract: Background. The practice of traditional Chinese medicine (TCM) began several thousand years ago, and the knowledge of practitioners is recorded in paper and electronic versions of case notes, manuscripts, and books in multiple languages. Developing a method of information extraction (IE) from these sources to generate a cohesive data set would be a great contribution to the medical field. The goal of this study was to perform a systematic review of the status of IE from TCM sources over the last 10 years. Meth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 73 publications
0
11
0
Order By: Relevance
“…These clinical texts recorded by CM practitioners would provide useful data to understand and manage patient heterogeneity. Currently, there is no “gold” standard of corpora for CM that could be adopted for use in analysing CM medical text data [ 54 ]. The unstructured nature and variable forms of Chinese text expressions makes data processing more difficult and hence requires additional manpower [ 54 ].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…These clinical texts recorded by CM practitioners would provide useful data to understand and manage patient heterogeneity. Currently, there is no “gold” standard of corpora for CM that could be adopted for use in analysing CM medical text data [ 54 ]. The unstructured nature and variable forms of Chinese text expressions makes data processing more difficult and hence requires additional manpower [ 54 ].…”
Section: Discussionmentioning
confidence: 99%
“…Currently, there is no “gold” standard of corpora for CM that could be adopted for use in analysing CM medical text data [ 54 ]. The unstructured nature and variable forms of Chinese text expressions makes data processing more difficult and hence requires additional manpower [ 54 ]. In the current study, the writing styles and habitual vocabularies were relatively less complex compared to other full text-mining analysis of complete CM medical records since this study only focused on CM Syndromes and clinical characteristics as recorded in the case report files of the previous observational study [ 14 ].…”
Section: Discussionmentioning
confidence: 99%
“…Computer-aided drug design methods have been applied in the field of drug development over the past two decades [56]. Currently, this is seen as one of the best appropriate alternatives to high-throughput screening, which is routinely used in drug design and development.…”
Section: Computer-aided Drug Designmentioning
confidence: 99%
“…Currently, research on pre-trained language models (PLMs) in the domain of TCM mainly focuses on entity recognition, clinical record classification, and feature extraction 9 11 . Pan et al 12 conducted an in-depth study on electronic medical records (EMRs) in TCM and proposed a named entity recognition (NER) pipeline called the ALBERT-BiLSTM-CRF, which focuses on TCM orthopedic EMRs.…”
Section: Introductionmentioning
confidence: 99%