2019
DOI: 10.1186/s13321-019-0376-1
|View full text |Cite
|
Sign up to set email alerts
|

Combining structural and bioactivity-based fingerprints improves prediction performance and scaffold hopping capability

Abstract: This study aims at improving upon existing activity predictions methods by augmenting chemical structure fingerprints with bio-activity based fingerprints derived from high-throughput screening (HTS) data (HTSFPs) and thereby showcasing the benefits of combining different descriptor types. This type of descriptor would be applied in an iterative screening scenario for more targeted compound set selection. The HTSFPs were generated from HTS data obtained from PubChem and combined with an ECFP4 structural finger… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
32
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
2

Relationship

1
9

Authors

Journals

citations
Cited by 37 publications
(32 citation statements)
references
References 21 publications
0
32
0
Order By: Relevance
“…These identifiers are then usually hashed to a bit vector with a fixed length in order to allow the comparison of different representations. In this work, we set a vector length of 1024 bits using the Morgan FPs implementation of RDKit [15] since it was found that the use of a larger vector size with ML algorithms such as random forest could only allow negligible performance improvements [17].…”
Section: Molecular Fingerprintsmentioning
confidence: 99%
“…These identifiers are then usually hashed to a bit vector with a fixed length in order to allow the comparison of different representations. In this work, we set a vector length of 1024 bits using the Morgan FPs implementation of RDKit [15] since it was found that the use of a larger vector size with ML algorithms such as random forest could only allow negligible performance improvements [17].…”
Section: Molecular Fingerprintsmentioning
confidence: 99%
“… 16 In other words, the PhGs shared by diverse active compounds can be useful for scaffold hopping (SH). 8 , 9 , 17 19 SH requires a method to capture the functional similarity with a focus on the interaction and freedom from scaffold-based or structural similarity. 8 , 9 …”
Section: Introductionmentioning
confidence: 99%
“…The key goal of QSAR studies is to determine a mathematical model between the property under investigation, and one or more molecular descriptors [12][13][14]. Using the model, similar bioactivities of compounds not involved in the training set can be predicted from their structural descriptors [15]. Here, we used genetic function approximation (GFA) and ELM models to predict the bioactivity of the molecules under investigation.…”
Section: Introductionmentioning
confidence: 99%