2022
DOI: 10.1371/journal.pdig.0000152
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale application of named entity recognition to biomedicine and epidemiology

Abstract: Background Despite significant advancements in biomedical named entity recognition methods, the clinical application of these systems continues to face many challenges: (1) most of the methods are trained on a limited set of clinical entities; (2) these methods are heavily reliant on a large amount of data for both pre-training and prediction, making their use in production impractical; (3) they do not consider non-clinical entities, which are also related to patient’s health, such as social, economic or demog… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
4
1

Relationship

2
7

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 28 publications
0
11
0
Order By: Relevance
“…Because of that, we rely on concepts that can be extracted from the abstracts of the author using a named entity recognition tool. For the concepts heuristic, the concepts in the abstracts and in the SOIs are therefore retrieved using the named entity recognition tool from the work of Raza et al [ 31 ].…”
Section: Methodsmentioning
confidence: 99%
“…Because of that, we rely on concepts that can be extracted from the abstracts of the author using a named entity recognition tool. For the concepts heuristic, the concepts in the abstracts and in the SOIs are therefore retrieved using the named entity recognition tool from the work of Raza et al [ 31 ].…”
Section: Methodsmentioning
confidence: 99%
“…Building textual codebook. Specifically, to establish a textual codebook, we use a pre-trained BioEN [35] to extract the related entities from the medical reports in the training set, where the related entities are divided into four entity groups, i.e., biological structure, detailed description, disease disorder, and sign symptom. Next, we compute the frequency of each entity and pick top κ 0 entities for each entity group as keywords in the textual codebook.…”
Section: Cxr-to-report Generationmentioning
confidence: 99%
“…Contributions Our proposed framework consists of a comprehensive pipeline that includes the creation of a high-quality database from published case reports, the design and implementation of NLP models to detect and examine clinical and non-clinical concepts in the data, and a thorough evaluation process. A named entity recognition (NER) algorithm 8 is included in the NLP models, and it is capable of accurately identifying essential clinical concepts such as diseases, conditions, symptoms, and drugs, as well as non-clinical concepts such as social determinants of health (SDOH) 9 . Furthermore, we developed a relation extraction (RE) model to identify relationships between these concepts, including disease-complication, treatment-improvement, and drug-adverse-effect associations.…”
Section: Introductionmentioning
confidence: 99%