Protein–protein interaction site prediction through combining local and global features with deep neural networks

Zeng, Min; Zhang, Fuhao; Wu, Fang‐Xiang; Li, Yaohang; Wang, Jianxin; Li, Min

doi:10.1093/bioinformatics/btz699

Cited by 205 publications

(255 citation statements)

References 42 publications

Supporting

Mentioning

253

Contrasting

Order By: Relevance

“…It can be found that the values of six measures achieved in these three independent benchmark datasets are lower than that in the dataset used in this work, but it is reasonable because the IHT-XGB model is built on this original dataset. Table 4 also shows the prediction performance of five models, i.e., SSWRF, LORIS, PSIVER, SCRIBER, and DELPHI, in identification of protein-protein interaction sites [31][32][33][34][35][36][37]. The highest results in each performance measures for different models are highlighted as bold type.…”

Section: Prediction Performance In Independent Benchmark Datasetsmentioning

confidence: 99%

Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm

Deng

Zhang

Wang

et al. 2020

IJMS

View full text Add to dashboard Cite

The study of protein-protein interaction is of great biological significance, and the prediction of protein-protein interaction sites can promote the understanding of cell biological activity and will be helpful for drug development. However, uneven distribution between interaction and non-interaction sites is common because only a small number of protein interactions have been confirmed by experimental techniques, which greatly affects the predictive capability of computational methods. In this work, two imbalanced data processing strategies based on XGBoost algorithm were proposed to re-balance the original dataset from inherent relationship between positive and negative samples for the prediction of protein-protein interaction sites. Herein, a feature extraction method was applied to represent the protein interaction sites based on evolutionary conservatism of proteins, and the influence of overlapping regions of positive and negative samples was considered in prediction performance. Our method showed good prediction performance, such as prediction accuracy of 0.807 and MCC of 0.614, on an original dataset with 10,455 surface residues but only 2297 interface residues. Experimental results demonstrated the effectiveness of our XGBoost-based method.

show abstract

Section: Prediction Performance In Independent Benchmark Datasetsmentioning

confidence: 99%

Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm

Deng

Zhang

Wang

et al. 2020

IJMS

View full text Add to dashboard Cite

show abstract

“…See Appendix K for additional discussion. scheme used by [15] (available at: https://github.com/CSUBioGroup/DeepPPISP). Table A1 shows the numbers of interaction and non-interaction sites in these datasets and the splits used in this study and [15].…”

Section: Discussionmentioning

confidence: 99%

EGRET: Edge Aggregated Graph Attention Networks and Transfer Learning Improve Protein-Protein Interaction Site Prediction

Mahbub

2020

Preprint

View full text Add to dashboard Cite

MotivationProtein-protein interactions are central to most biological processes. However, reliable identification of protein-protein interaction (PPI) sites using conventional experimental methods is slow and expensive. Therefore, great efforts are being put into computational methods to identify PPI sites.ResultsWe present EGAT, a highly accurate deep learning based method for PPI site prediction, where we have introduced a novel edge aggregated graph attention network to effectively leverage the structural information. We, for the first time, have used transfer learning in PPI site prediction. Our proposed edge aggregated network, together with transfer learning, has achieved remarkable improvement over the best alternate methods. Furthermore, EGAT offers a more interpretable framework than the typical black-box deep neural networks.AvailabilityEGAT is freely available as an open source project at https://github.com/Sazan-Mahbub/EGAT.

show abstract

“…As examples, we make available two graph-based protein structure datasets. The first, based on the collections outlined in (Zeng et al, 2019), consists of 420 proteins, with node labels indicating whether a residue is involved in a protein-protein interaction. The interaction status data and structure originate from structures of the complexes in the RCSB PDB.…”

Section: Datasetsmentioning

confidence: 99%

Graphein - a Python Library for Geometric Deep Learning and Network Analysis on Protein Structures and Interaction Networks

Jamasb

Lió

Blundell

2020

Preprint

View full text Add to dashboard Cite

Graphein is a python library for constructing graph and surface-mesh representations of protein structures for computational analysis. The library interfaces with popular geometric deep learning libraries: DGL, PyTorch Geometric and PyTorch3D. Geometric deep learning is emerging as a popular methodology in computational structural biology. As feature engineering is a vital step in a machine learning project, the library is designed to be highly flexible, allowing the user to parameterise the graph construction, scaleable to facilitate working with large protein complexes, and containing useful pre-processing tools for preparing experimental structure files. Graphein is also designed to facilitate network-based and graph-theoretic analyses of protein structures in a high-throughput manner. As example workflows, we make available two new protein structure-related datasets, previously unused by the geometric deep learning community.Availability and implementationGraphein is written in python. Source code, example usage and datasets, and documentation are made freely available under a MIT License at the following URL: https://github.com/a-r-j/graphein

show abstract

Protein–protein interaction site prediction through combining local and global features with deep neural networks

Cited by 205 publications

References 42 publications

Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm

Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm

EGRET: Edge Aggregated Graph Attention Networks and Transfer Learning Improve Protein-Protein Interaction Site Prediction

Graphein - a Python Library for Geometric Deep Learning and Network Analysis on Protein Structures and Interaction Networks

Contact Info

Product

Resources

About