2019
DOI: 10.1101/662098
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Evaluation of network architecture and data augmentation methods for deep learning in chemogenomics

Abstract: Among virtual screening methods that have been developed to facilitate the drug discovery process, chemogenomics presents the particularity to tackle the question of predicting ligands for proteins, at at scales both in the protein and chemical spaces. Therefore, in addition to to predict drug candidates for a given therapeutic protein target, like more classical ligand-based or receptor-based methods do, chemogenomics can also predict off-targets at the proteome level, and therefore, identify potential side-e… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 74 publications
0
8
0
Order By: Relevance
“…Transfer learning, as part of lifelong learning, is inspired by how quickly humans acquire new knowledge from other similar experiences in the past. Transfer learning can improve many problems of insufficient data by fine-tuning a pre-trained model with a large dataset in another or a general field to an actual small-scale dataset [ 181 ]. Bonggun et al [ 23 ] imported a molecule representation model learned from the PubChem database and applied it to their DTI model to improve performance.…”
Section: Limitation and Future Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Transfer learning, as part of lifelong learning, is inspired by how quickly humans acquire new knowledge from other similar experiences in the past. Transfer learning can improve many problems of insufficient data by fine-tuning a pre-trained model with a large dataset in another or a general field to an actual small-scale dataset [ 181 ]. Bonggun et al [ 23 ] imported a molecule representation model learned from the PubChem database and applied it to their DTI model to improve performance.…”
Section: Limitation and Future Workmentioning
confidence: 99%
“…Other metrics such as concordance index (CI or C-index) and Spearman's correlation coefficient (ρ) quantify the quality of rankings by comparing the order of the predictions and the order of the ground truths. A frequently used ranking metric in the DTA prediction is the CI [25,130,181]. When predicting the binding affinity values of two random DTPs, the CI measures whether those values were predicted in the same order as their actual values.…”
Section: Regression Evaluation Metricsmentioning
confidence: 99%
“…Traditionally, an experimental assay is the surest way to obtain the desired binding affinity, but it is expensive and time-consuming to use this approach to analyze many possible DT pairs. A plethora of drug-like compounds and latent protein targets pose greater challenges because multiple drugs can be associated with multiple targets [ 1 , 2 ]. As a result, drug–target affinity (DTA) prediction has attracted considerable attention in recent years [ 3 ].…”
Section: Introductionmentioning
confidence: 99%
“…Actually, there are a variety of scenarios in which the classifications of objects (bags) can only be determined by some key components (instances), such as medical diagnoses; that is, some instances trigger the bag label. Following this concept, DTIs can be characterized by an MIL framework: the private representation contains abundant information that has been proven to be effective for DTA prediction [ 6 , 9 , 11 , 19 ], as does each public feature obtained via early fusion [ 2 , 12 , 13 , 14 ] and each public feature obtained via concatenation [ 4 , 5 , 7 , 20 ]. However, the exact contribution of each instance to the final DTA value of the bag is unknown.…”
Section: Introductionmentioning
confidence: 99%
“…Actually, there are a variety of scenarios in which the classifications of objects (bags) can only be determined by some key components (instances), such as medical diagnoses; that is, some instances trigger the bag label. Following this concept, DTIs can be characterized by an MIL framework: the private representation contains abundant information that has been proven to be effective for DTA prediction [6,9,11,19], as does each public feature obtained via early fusion [2,[12][13][14] and each public feature obtained via concatenation [4,5,7,20]. However, the exact contribution of each instance to the final DTA value of the bag is unknown.…”
Section: Introductionmentioning
confidence: 99%