Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 1 2017
DOI: 10.18653/v1/e17-1109
|View full text |Cite
|
Sign up to set email alerts
|

Entity Extraction in Biomedical Corpora: An Approach to Evaluate Word Embedding Features with PSO based Feature Selection

Abstract: Text mining has drawn significant attention in recent past due to the rapid growth in biomedical and clinical records. Entity extraction is one of the fundamental components for biomedical text mining. In this paper, we propose a novel approach of feature selection for entity extraction that exploits the concept of deep learning and Particle Swarm Optimization (PSO). The system utilizes word embedding features along with several other features extracted by studying the properties of the datasets. We obtain an … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
12
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 37 publications
1
12
0
Order By: Relevance
“…Interestingly, unlike for the other two datasets, the introduction of pre-trained word embeddings to the system results in reduced performance on Ohsumed. This suggests that general domain embeddings might not be beneficial in specialized domains, which corroborates previous findings by Yadav et al (2017) on a different task, i.e. entity extraction.…”
Section: Resultssupporting
confidence: 89%
“…Interestingly, unlike for the other two datasets, the introduction of pre-trained word embeddings to the system results in reduced performance on Ohsumed. This suggests that general domain embeddings might not be beneficial in specialized domains, which corroborates previous findings by Yadav et al (2017) on a different task, i.e. entity extraction.…”
Section: Resultssupporting
confidence: 89%
“…For example, Ahmed et al [ 20 ] and Lei et al [ 21 ] use SVM, KNN, DT, SVM to extract the named entities from biomedical corpus and Chinese clinical text. Shweta et al [ 22 ] apply PSO (Particle Swarm Optimization) model for feature selection in NER research. Nowadays, the most popular methods have also been turned into using deep learning frameworks, including active learning [ 23 ], DNN (deep neural networks)-CRF [ 24 ] or LSTM-CRF [ 25 – 27 ].…”
Section: Related Workmentioning
confidence: 99%
“…and Priego, 2017) and sentence similarity detection . In terms of the biomedical domain, word embedding-based features have been used for entity extraction in biomedical corpora (Yadav et al, 2017) or clinical information extraction (Kholghi et al, 2016). Several approaches for personal health mention classification have been reported (Aramaki et al, 2011;Lamb et al, 2013a;Yin et al, 2015).…”
Section: Context-basedmentioning
confidence: 99%