2021
DOI: 10.1007/s13755-021-00156-6
|View full text |Cite
|
Sign up to set email alerts
|

ITEXT-BIO: Intelligent Term EXTraction for BIOmedical analysis

Abstract: Here, we introduce ITEXT-BIO, an intelligent process for biomedical domain terminology extraction from textual documents and subsequent analysis. The proposed methodology consists of two complementary approaches, including free and driven term extraction. The first is based on term extraction with statistical measures, while the second considers morphosyntactic variation rules to extract term variants from the corpus. The combination of two term extraction and analysis strategies is the keystone of ITEXT-BIO. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…According to the review, one of the most commonly used algorithms is C-Value, which handles a hybrid approach and uses the frequency of occurrence and nesting of candidate terms to determine the relevant ones depending on the domain. The authors agree that the algorithm's potential lies in its hybrid characteristic because one part is in charge of applying statistical methods, and subsequently, linguistic filters are applied (Lahbib et al 2015;Bakar et al 2015;Ali and Saad 2016;Du et al 2016;Benabdallah et al 2017;Arora et al 2017;Mykowiecka et al 2018;Kafando et al 2021).…”
Section: What Algorithms Are Used In Automatic Term Extractors?mentioning
confidence: 97%
See 1 more Smart Citation
“…According to the review, one of the most commonly used algorithms is C-Value, which handles a hybrid approach and uses the frequency of occurrence and nesting of candidate terms to determine the relevant ones depending on the domain. The authors agree that the algorithm's potential lies in its hybrid characteristic because one part is in charge of applying statistical methods, and subsequently, linguistic filters are applied (Lahbib et al 2015;Bakar et al 2015;Ali and Saad 2016;Du et al 2016;Benabdallah et al 2017;Arora et al 2017;Mykowiecka et al 2018;Kafando et al 2021).…”
Section: What Algorithms Are Used In Automatic Term Extractors?mentioning
confidence: 97%
“…On the other hand, the Term Frequency-Inverse Document Frequency (TF-IDF) algorithm applies weighting methods to express the level of relevance of a candidate term within a text. It is an algorithm used for term extraction, and although there are many systems with different variations, the basis is still TF-IDF (Guo et al 2015;Ali and Saad 2016;Abduljabbar et al 2018;Afrizal et al 2019;Kafando et al 2021).…”
Section: What Algorithms Are Used In Automatic Term Extractors?mentioning
confidence: 99%
“…While many strategies for identifying MWEs have been presented in the past (Ramisch et al, 2010;Kafando et al, 2021;, we found that applying them to the medical domain (and especially its clinical counterpart) was challenging due to the extreme corpus size that would be required to produce statistically significant results for the long tail of medical entities.…”
Section: Introductionmentioning
confidence: 99%
“…While many strategies for identifying MWEs have been presented in the past (Ramisch et al, 2010;Kafando et al, 2021;Zeng and Bhat, 2021), we found that applying them to the medical domain (and especially its clinical counterpart) was challenging due to the extreme corpus size that would be required to produce statistically significant results for the long tail of medical entities.…”
Section: Introductionmentioning
confidence: 99%