2020
DOI: 10.1186/s12859-020-3523-9
|View full text |Cite
|
Sign up to set email alerts
|

Decoy selection for protein structure prediction via extreme gradient boosting and ranking

Abstract: Background Identifying one or more biologically-active/native decoys from millions of non-native decoys is one of the major challenges in computational structural biology. The extreme lack of balance in positive and negative samples (native and non-native decoys) in a decoy set makes the problem even more complicated. Consensus methods show varied success in handling the challenge of decoy selection despite some issues associated with clustering large decoy sets and decoy sets that do not show … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 58 publications
0
3
0
Order By: Relevance
“…Further details can be found in [ 54 ]. We note that basin finding over conformation-energy samples has been employed by several works for molecular structure–function studies [ 12 , 55 , 56 , 57 ].…”
Section: Methodsmentioning
confidence: 99%
“…Further details can be found in [ 54 ]. We note that basin finding over conformation-energy samples has been employed by several works for molecular structure–function studies [ 12 , 55 , 56 , 57 ].…”
Section: Methodsmentioning
confidence: 99%
“…Not only is it challenge for our work, but also for other studies, to develop a quality negative set. There are a variety of techniques for generating decoy sequences in computational genomics research (i.e., pseudo-negative samples), for example, randomized matching generation [46] , machine learning-aided selection [47] , background sequence selection [48] , and sequence recombination [48] , [49] . Each technique has its own characteristics, and the optimal strategy will depend on the specifics of the prediction task and the research objectives.…”
Section: Resultsmentioning
confidence: 99%
“…However, identifying the best structure remains a challenge. Therefore, it is necessary to employ a quality assessment stage to identify high-quality, near-native decoys among the generated decoys ( Akhter et al, 2020) . This remains true even with AlphaFold’s recent breakthrough performance ( Chen et al, 2023) .…”
Section: Introductionmentioning
confidence: 99%