2021
DOI: 10.1093/database/baab050
|View full text |Cite
|
Sign up to set email alerts
|

Nabe: an energetic database of amino acid mutations in protein–nucleic acid binding interfaces

Abstract: Protein–nucleic acid complexes play essential roles in regulating transcription, translation, DNA replication, repair and recombination, RNA processing and translocation. Site-directed mutagenesis has been extremely useful in understanding the principles of protein–DNA and protein–RNA interactions, and experimentally determined mutagenesis data are prerequisites for designing effective algorithms for predicting the binding affinity change upon mutation. However, a vital challenge in this area is the lack of su… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 34 publications
0
7
0
Order By: Relevance
“…A few databases with available experimental information are listed in Table 2, some of which gather information from other mutagenesis experiences besides alanine. For protein–protein complexes there are four main databases: the alanine scanning energetics database (ASEdb), 11 protein–protein complex mutation thermodynamics (PROXiMATE, 174 previously known as PINT 172 ), the binding interface database (BID 116 ), and structural database of kinetics and energetics of mutant protein interactions (SKEMPI), whereas for protein–nucleic acid, we can access protein–nucleic acid interactions (PRONIT 176 ) and protein–nucleic acid binding energetic database (NABE 177 ). Table 2 also includes some other curated, nonredundant datasets of mutations that satisfy a few requirements: An existing set of relative binding free energy difference (ΔΔ G binding ) values for interfacial residues coming from experimental alanine mutagenesis; Availability of a three‐dimensional (3D)‐structure in the protein databank (PDB); and A maximum of 35% sequence identity in each interface, hence preventing repeated complexes. …”
Section: In Silico Methodologies For Hs Identification/predictionmentioning
confidence: 99%
See 1 more Smart Citation
“…A few databases with available experimental information are listed in Table 2, some of which gather information from other mutagenesis experiences besides alanine. For protein–protein complexes there are four main databases: the alanine scanning energetics database (ASEdb), 11 protein–protein complex mutation thermodynamics (PROXiMATE, 174 previously known as PINT 172 ), the binding interface database (BID 116 ), and structural database of kinetics and energetics of mutant protein interactions (SKEMPI), whereas for protein–nucleic acid, we can access protein–nucleic acid interactions (PRONIT 176 ) and protein–nucleic acid binding energetic database (NABE 177 ). Table 2 also includes some other curated, nonredundant datasets of mutations that satisfy a few requirements: An existing set of relative binding free energy difference (ΔΔ G binding ) values for interfacial residues coming from experimental alanine mutagenesis; Availability of a three‐dimensional (3D)‐structure in the protein databank (PDB); and A maximum of 35% sequence identity in each interface, hence preventing repeated complexes. …”
Section: In Silico Methodologies For Hs Identification/predictionmentioning
confidence: 99%
“…A few databases with available experimental information are listed in Table 2, some of which gather information from other mutagenesis experiences besides alanine. For protein-protein complexes there are four main databases: the alanine scanning energetics database (ASEdb), 11 protein-protein complex mutation thermodynamics (PROXiMATE, 174 previously known as PINT 172 ), the binding interface database (BID 116 ), and structural database of kinetics and energetics of mutant protein interactions (SKEMPI), whereas for protein-nucleic acid, we can access protein-nucleic acid interactions (PRONIT 176 ) and proteinnucleic acid binding energetic database (NABE 177 ). 1.…”
Section: Databasesmentioning
confidence: 99%
“…Experiments conducted in vitro may not truly reflect the activity of proteins in the body. Moreover, the datasets in biological databases have problems such as noise, deviation, and missing data ( Su et al, 2019 ; Liu et al, 2020d ; Liu et al, 2021b ; Su et al, 2021 ).…”
Section: Existing Problems and Research Prospectsmentioning
confidence: 99%
“…In this work, we proposed a novel method based on discrete wavelet transform (DWT) and wavelet packet transform (WPT) to describe conventional features, termed WTL-PDH, to predict hot spots in protein–DNA binding interfaces. We screened 339 mutations in 117 protein–DNA complexes from dbAMEPNI [ 21 ], SAMPDI, Nabe [ 22 ], ProNAB [ 23 ], and then used Synthetic Minority Over-sampling Technique (SMOTE) [ 24 ] to solve the imbalance between positive and negative samples. Firstly, we extracted 43 dimensional traditional features in terms of solvent accessibility surface area, secondary structure, protrusion index and depth index, and hydrogen bond.…”
Section: Introductionmentioning
confidence: 99%