2015
DOI: 10.1007/978-3-319-21212-8_12
|View full text |Cite
|
Sign up to set email alerts
|

MEDLINE Text Mining: An Enhancement Genetic Algorithm Based Approach for Document Clustering

Abstract: MEDLINE is the largest biomedical literature database. It is updated daily with 200-4,000 citations. This permanent growth induces the need of a good MEDLINE abstract clustering to accelerate the procedure of research and information retrieval. Several works have been developed in this context, but clustering MEDLINE abstracts are still an area where researchers are trying to propose new approaches to better clustering. Over the last few years, evolutionary algorithms have been widely applied to clustering pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 56 publications
(13 citation statements)
references
References 50 publications
0
13
0
Order By: Relevance
“…This study uses the standard information retrieval metrics precision, recall, and f-measure [ 56 59 ] to evaluate the proposed model. Precision intends to evaluate the cluster based on the number of correctly retrieved instances out of the total number of retrieved queries.…”
Section: Resultsmentioning
confidence: 99%
“…This study uses the standard information retrieval metrics precision, recall, and f-measure [ 56 59 ] to evaluate the proposed model. Precision intends to evaluate the cluster based on the number of correctly retrieved instances out of the total number of retrieved queries.…”
Section: Resultsmentioning
confidence: 99%
“…This issue was addressed with a fuzzy similaritybased data cleansing approach; The authors applied vector space and topic modelling to extract the rich patient-specific information available in unstructured clinical data. This can be crucial in countries where structured EHR adoption is not widespread [31,32]. The authors worked with 223, 556 nursing notes of 357.8 words on average, predicting 19 ICD-9 code group labels, and achieved a maximum F1-score (weighted-) of 69.81 across all the tested models.…”
Section: Related Workmentioning
confidence: 99%
“…Chowdhuri et al [5] presented a review for Rough set based Ad Hoc network. Karaa et al [6] proposed a novel approach for document clustering MEDLINE based on genetic algorithm. Kausar et al [7] discussed the data mining method applied in the extraction of clinical attribute and classification for diagnosis of cardiovascular patients.…”
Section: Literature Reviewmentioning
confidence: 99%