2021
DOI: 10.1371/journal.pcbi.1009171
|View full text |Cite
|
Sign up to set email alerts
|

Protein domain-based prediction of drug/compound–target interactions and experimental validation on LIM kinases

Abstract: Predictive approaches such as virtual screening have been used in drug discovery with the objective of reducing developmental time and costs. Current machine learning and network-based approaches have issues related to generalization, usability, or model interpretability, especially due to the complexity of target proteins’ structure/function, and bias in system training datasets. Here, we propose a new method “DRUIDom” (DRUg Interacting Domain prediction) to identify bio-interactions between drug candidate co… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

3
5

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 70 publications
1
6
0
Order By: Relevance
“…Considering algorithmic approaches, sequence-based protein representations can be grouped as conventional/ classical descriptors (or descriptor sets) and learned embeddings. Conventional descriptors are mostly modeldriven, meaning that they are generated by applying predefined rules and/or statistical calculations on sequences considering various molecular properties that include physicochemical [10][11][12], geometrical [13,14] and topological [12] characteristics of amino acids, as well as sequence composition [11,15], semantic similarities [16], functional characteristics/properties [17][18][19][20], and evolutionary relationships [13,21] of proteins. Learned protein embeddings (a.k.a.…”
Section: Introductionmentioning
confidence: 99%
“…Considering algorithmic approaches, sequence-based protein representations can be grouped as conventional/ classical descriptors (or descriptor sets) and learned embeddings. Conventional descriptors are mostly modeldriven, meaning that they are generated by applying predefined rules and/or statistical calculations on sequences considering various molecular properties that include physicochemical [10][11][12], geometrical [13,14] and topological [12] characteristics of amino acids, as well as sequence composition [11,15], semantic similarities [16], functional characteristics/properties [17][18][19][20], and evolutionary relationships [13,21] of proteins. Learned protein embeddings (a.k.a.…”
Section: Introductionmentioning
confidence: 99%
“…Given that membrane proteins are greatly involved in DTIs, as indicated above, we assume that these tools will also have an acceptable ability to predict DTIs when the drug target is a membrane protein, even though the precise ratio of membrane proteins vs. globular proteins was not made clear by these aforementioned studies in their training datasets. This has been suggested in the DRUIDom work [263] . However, compared with sequence-based prediction of DTIs, the field is greatly devoid of prediction tools developed based on 3D protein structures [264] .…”
Section: Membrane Proteins and Drug Target Interactionmentioning
confidence: 77%
“…Compounds with IC 50 lower than 10 μM were labeled as positives and compounds with IC 50 higher than 20 μM were labeled as negatives. Compounds with IC 50 between 10 and 20 μM were considered ambiguous and were therefore discarded in a similar manner to a previous work by Rifaioglu et al 41 and Doğan et al 42 From the two databases, a total of 2281 instances (2242 positives and 39 negatives) were obtained. Since there was a higher proportion of negatives—compounds with weak or no GR binding affinities—in the real-world data, we designed the training dataset to have a ten-fold greater proportion of negatives than positives.…”
Section: Methodsmentioning
confidence: 99%