2020
DOI: 10.1107/s2059798320009080
|View full text |Cite
|
Sign up to set email alerts
|

Predicting protein model correctness in Coot using machine learning

Abstract: Manually identifying and correcting errors in protein models can be a slow process, but improvements in validation tools and automated model-building software can contribute to reducing this burden. This article presents a new correctness score that is produced by combining multiple sources of information using a neural network. The residues in 639 automatically built models were marked as correct or incorrect by comparing them with the coordinates deposited in the PDB. A number of features were also calculate… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 20 publications
(17 citation statements)
references
References 22 publications
0
17
0
Order By: Relevance
“…2010; Bond et al . 2020) for analysis, and UCSF Chimera 1.14 for visualization (Pettersen et al . 2004).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…2010; Bond et al . 2020) for analysis, and UCSF Chimera 1.14 for visualization (Pettersen et al . 2004).…”
Section: Methodsmentioning
confidence: 99%
“…The resulting ensembles of top rumenic acid binding poses were selected according to the low-binding energy cut-off, equal to one hydrogen bond (≤−3.0 kcal mol −1 ). These ensembles were exported to PyMOL (The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC), COOT (Emsley et al 2010;Bond et al 2020) for analysis, and UCSF Chimera 1.14 for visualization (Pettersen et al 2004).…”
Section: Homology Modelling and Docking Analysismentioning
confidence: 99%
“…Earlier model building methods using machine learning relied on pattern matching [14] or feature-based approaches [15] [16]. In [17], neural networks are used to validate models built using Coot [18]. Protein electron density maps are usually represented [7] as three-dimensional volumetric data which is common in bio-imaging (e.g.…”
Section: Related Workmentioning
confidence: 99%
“…We used data sets from three sources to train and evaluate our ML predictive model: 1203 experimental phasing data sets from the Joint Center for Structural Genomics (JCSG; van den Bedem et al, 2011;Alharbi et al, 2019), 32 newer experimental phasing data sets deposited between 2015 and 2021 and taken from the PDB, and 1332 molecular-replacement (MR) data sets from Bond et al (2020). These data sets correspond to two techniques that can be used to build a protein structure.…”
Section: Data Setsmentioning
confidence: 99%
“…(v) Sequence identity: the sequence identity calculated by superposition of the homologue chain onto the target chain using GESAMT (Krissinel, 2012;Bond et al, 2020).…”
Section: Electron-density Map Featuresmentioning
confidence: 99%