2019
DOI: 10.1093/database/bay138
|View full text |Cite
|
Sign up to set email alerts
|

Extracting chemical–protein interactions from literature using sentence structure analysis and feature engineering

Abstract: Information about the interactions between chemical compounds and proteins is indispensable for understanding the regulation of biological processes and the development of therapeutic drugs. Manually extracting such information from biomedical literature is very time and resource consuming. In this study, we propose a computational method to automatically extract chemical–protein interactions (CPIs) from a given text. Our method extracts CPI pairs and CPI triplets from sentences, where a CPI pair consists of a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 37 publications
(18 citation statements)
references
References 31 publications
0
18
0
Order By: Relevance
“…Differently from the works cited above, Lung et al [34] used traditional machine learning algorithms with handcrafted features, achieving an F1-score of 0.5671. As part of their approach, the authors manually built a dictionary with 1155 interaction words, which where mapped to the corresponding CPR type, to create CPI triplets.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Differently from the works cited above, Lung et al [34] used traditional machine learning algorithms with handcrafted features, achieving an F1-score of 0.5671. As part of their approach, the authors manually built a dictionary with 1155 interaction words, which where mapped to the corresponding CPR type, to create CPI triplets.…”
Section: Resultsmentioning
confidence: 99%
“…Though, we did not follow this possibility leaving it as possible future work. Similar binary approaches were followed by Lung et al [33, 34] and Warikoo et al [44] who start by predicting if a CPR pair is positive.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…e rules can be derived from manually annotated corpora using machine learning algorithms or de ned manually by a domain expert. Machine learning-based approaches are also employed for the relation extraction from biomedical text [17,20,22,38]. e machine learning category includes methods based on feature engineering, graph kernels, and deep learning.…”
Section: Related Workmentioning
confidence: 99%
“…We obtained these features based on (1) domain expertise and (2) informative features used with similar relation extraction problem [22]. e full list of engineered features and their value type (within parentheses) is as follows:…”
Section: Engineered Featuresmentioning
confidence: 99%