2021
DOI: 10.1007/978-3-030-89657-7_20
|View full text |Cite
|
Sign up to set email alerts
|

Similarity Search for an Extreme Application: Experience and Implementation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 20 publications
0
6
0
Order By: Relevance
“…We have chosen to test our approach on 3D protein structures for several reasons. First, while protein structure data is very widely used, and the study of this data is vital for almost every area of biochemical research, the issue of efficient search and comparison of protein structures is still unresolved to some extent, with many databases still relying on time-consuming brute-force linear search [14]. This data is also publicly available in a single database, called the Protein Data Bank (PDB), which is used by the majority of protein researchers and widely agreed upon as the standard.…”
Section: Data Domainmentioning
confidence: 99%
See 4 more Smart Citations
“…We have chosen to test our approach on 3D protein structures for several reasons. First, while protein structure data is very widely used, and the study of this data is vital for almost every area of biochemical research, the issue of efficient search and comparison of protein structures is still unresolved to some extent, with many databases still relying on time-consuming brute-force linear search [14]. This data is also publicly available in a single database, called the Protein Data Bank (PDB), which is used by the majority of protein researchers and widely agreed upon as the standard.…”
Section: Data Domainmentioning
confidence: 99%
“…We evaluated our approach using range queries, with 512 randomly chosen protein chains from the dataset used as query objects. In order to compare our results against the ground truth, we needed to know the Q distances (based on Q score ) between the 512 protein chains and all the other chains in the database -these distances were kindly provided by the researchers behind [14], where the same 512 objects were used as the pivots for their search engine. The objects were chosen uniformly randomly with respect to protein chain length, which ensures that even very long proteins are represented among our queries (despite constituting a relatively small portion of the dataset).…”
Section: Experimental Evaluationmentioning
confidence: 99%
See 3 more Smart Citations