2019
DOI: 10.1021/acsomega.9b02470
|View full text |Cite
|
Sign up to set email alerts
|

Large-Scale Comparison of Alternative Similarity Search Strategies with Varying Chemical Information Contents

Abstract: Similarity searching (SS) is a core approach in computational compound screening and has a long tradition in pharmaceutical research. Over the years, different approaches have been introduced to increase the information content of search calculations and optimize the ability to detect compounds having similar activity. We present a large-scale comparison of distinct search strategies on more than 600 qualifying compound activity classes. Challenging test cases for SS were identified and used to evaluate differ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0
1

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(11 citation statements)
references
References 23 publications
0
10
0
1
Order By: Relevance
“…In contrast, long FPs cause compute performance and storage issues and may produce overfitted ML models, a problem known as the curse of dimensionality . Several authors explored different MFP and IFP lengths; ,,,, however, there is no single consensus on length, as results vary with the problem and data sets.…”
Section: Discussionmentioning
confidence: 99%
“…In contrast, long FPs cause compute performance and storage issues and may produce overfitted ML models, a problem known as the curse of dimensionality . Several authors explored different MFP and IFP lengths; ,,,, however, there is no single consensus on length, as results vary with the problem and data sets.…”
Section: Discussionmentioning
confidence: 99%
“…A 2D similarity-based virtual screening [ 53 , 54 ] was performed on the SwissSimilarity web tool [ 55 ], enabling ligand-based virtual screening of several libraries of small molecules using different approaches. The search was carried out in 31 databases [ 55 ].…”
Section: Aterial and Methodsmentioning
confidence: 99%
“…Usually, researchers may want to search for molecules or materials with similar properties in applications like discovering new drugs or cheaper materials [4][5][6] . Many similarity search methods have been developed for this purpose 7,8 . In general, a similarity search approach consists of three essential components: a molecular representation method, a quantitative metric to measure the similarity of two molecules, and a search algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…[4][5][6] Many similarity search methods have been developed for this purpose. 7,8 In general, a similarity search approach consists of three essential components: a molecular representation method, a quantitative metric to measure the similarity of two molecules, and a search algorithm. The search process usually starts with one or more query molecules (e.g., congurations that have desired properties).…”
Section: Introductionmentioning
confidence: 99%