Intrinsic-Extrinsic Convolution and Pooling for Learning on 3D Protein Structures

Hermosilla, Pedro; Schäfer, Marco; Lang, Matěj; Fackelmann, Gloria; Vázquez, Pere Pau; Kozlíková, Barbora; Krone, Michael; Ritschel, Tobias; Ropinski, Timo

doi:10.48550/arxiv.2007.06252

Cited by 7 publications

(13 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Molformer [95] is a variant of Transformer [90] and operates on 3D heterogeneous molecular graphs with motifs. IEConv [35] designs a convolution operator that considers the primary, secondary, and tertiary structure of proteins and a set of hierarchical pooling operators for multi-scale modeling. 3DCNN and 3DGCN [87] are also competitive 3D methods.…”

Section: Methodsmentioning

confidence: 99%

“…Apart from that, Zhang et al [98] combines a multi-view contrastive learning and a self-prediction learning to encode geometric features of proteins. Then these semantic representations learned from SSL are utilized for downstream tasks including structure classification [35], model quality assessment [4], and function prediction [30]. Nevertheless, no preceding research excavate the potential of pre-training on this sort of spatial-temporal data, partly because of the high expenditure to run MD simulations.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Pre-training of Equivariant Graph Matching Networks with Conformation Flexibility for Drug Binding

Wu¹,

Jiang²,

Jin³

et al. 2022

Preprint

View full text Add to dashboard Cite

The latest biological findings discover that the motionless 'lock-and-key' theory is no longer applicable and the flexibility of both the receptor and ligand plays a significant role in helping understand the principles of the binding affinity prediction. Based on this mechanism, molecular dynamics (MD) simulations have been invented as a useful tool to investigate the dynamical properties of this molecular system. However, the computational expenditure prohibits the growth of reported protein trajectories. To address this insufficiency, we present a novel spatial-temporal pre-training protocol, PretrainMD, to grant the protein encoder the capacity to capture the time-dependent geometric mobility along MD trajectories. Specifically, we introduce two sorts of self-supervised learning tasks: an atom-level denoising generative task and a protein-level snapshot ordering task. We validate the effectiveness of PretrainMD through the PDBbind dataset for both linear-probing and fine-tuning. Extensive experiments show that our PretrainMD exceeds most state-of-the-art methods and achieves comparable performance. More importantly, through visualization we discover that the learned representations by pre-training on MD trajectories without any label from the downstream task follow similar patterns of the magnitude of binding affinities. This strongly aligns with the fact that the motion of the interactions of protein and ligand maintains the key information of their binding. Our work provides a promising perspective of self-supervised pre-training for protein representations with very fine temporal * The corresponding authors.Preprint. Under review.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Pre-training of Equivariant Graph Matching Networks with Conformation Flexibility for Drug Binding

Wu¹,

Jiang²,

Jin³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Amidi et al (2018) employs a similar idea to classify enzymes classes by using 3D CNN. 3D CNNs also shed light on other tasks such as interface prediction (Townshend et al, 2019) and protein fold recognition (Hermosilla et al, 2020). Gainza et al (2020); Sverrisson et al (2021) extend 3D CNNs to spherical convolutions for operating on radius regions, which can also be naturally applied to the Fourier space (Zhemchuzhnikov et al, 2021) and the 3D Voronoi Tessellation space (Igashov et al, 2021).…”

Section: Related Workmentioning

confidence: 99%

“…Therefore, processing such 3D structures is the key for protein function analysis. While we have witnessed remarkable progress in protein structure predictions (Rohl et al, 2004;Källberg et al, 2012;Baek et al, 2021;Jumper et al, 2021), another thread of tasks with protein 3D structures as input starts to draw a great interest, such as function prediction (Hermosilla et al, 2020;Gligorijević et al, 2021), decoy ranking (Lundström et al, 2001;Kwon et al, 2021;Wang et al, 2021), protein docking (Duhovny et al, 2002;Shulman-Peleg et al, 2004;Gainza et al, 2020;Sverrisson et al, 2021), and driver mutation identification (Lefèvre et al, 1997;Antikainen & Martin, 2005;Li et al, 2020;Jankauskaitė et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

Directed Weight Neural Networks for Protein Structure Representation Learning

Li¹,

Luo²,

Deng³

et al. 2022

Preprint

View full text Add to dashboard Cite

A protein performs biological functions by folding to a particular 3D structure. To accurately model the protein structures, both the overall geometric topology and local fine-grained relations between amino acids (e.g. side-chain torsion angles and inter-amino-acid orientations) should be carefully considered. In this work, we propose the Directed Weight Neural Network for better capturing geometric relations among different amino acids. Extending a single weight from a scalar to a 3D directed vector, our new framework supports a rich set of geometric operations on both classical and SO(3)-representation features, on top of which we construct a perceptron unit for processing amino-acid information. In addition, we introduce an equivariant message passing paradigm on proteins for plugging the directed weight perceptrons into existing Graph Neural Networks, showing superior versatility in maintaining SO(3)-equivariance at the global scale. Experiments show that our network has remarkably better expressiveness in representing geometric relations in comparison to classical neural networks and the (globally) equivariant networks. It also achieves state-of-the-art performance on various computational biology applications related to protein 3D structures. All codes and models will be published upon acceptance.

show abstract

“…For example, representing proteins by their amino acid sequences has been shown to provide powerful structural information by comparing sequences to each other [14] and extracting rich unsupervised learning representations using self-attention Transformers [20]. From a geometric viewpoint, several works have aimed to encode protein structural priors directly within neural network architectures to model proteins hierarchically [21,22], as computationally-efficient point clouds [23,24], or as k-nearest neighbors (k-NN) geometric graphs [25,26] for tasks such as protein function prediction [27], protein model quality assessment [28], and protein interaction region prediction [29].…”

Section: Related Workmentioning

confidence: 99%

EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex Structures

Morehead¹,

Chen²,

Wu³

et al. 2022

Preprint

View full text Add to dashboard Cite

Protein complexes are macromolecules essential to the functioning and well-being of all living organisms. As the structure of a protein complex, in particular its region of interaction between multiple protein subunits (i.e., chains), has a notable influence on the biological function of the complex, computational methods that can quickly and effectively be used to refine and assess the quality of a protein complex's 3D structure can directly be used within a drug discovery pipeline to accelerate the development of new therapeutics and improve the efficacy of future vaccines. In this work, we introduce the Equivariant Graph Refiner (EGR), a novel E(3)-equivariant graph neural network (GNN) for multi-task structure refinement and assessment of protein complexes. Our experiments on new, diverse protein complex datasets, all of which we make publicly available in this work, demonstrate the state-of-the-art effectiveness of EGR for atomistic refinement and assessment of protein complexes and outline directions for future work in the field. In doing so, we establish a baseline for future studies in macromolecular refinement and structure analysis. 1 Preprint. Under review.

show abstract

Intrinsic-Extrinsic Convolution and Pooling for Learning on 3D Protein Structures

Cited by 7 publications

References 35 publications

Pre-training of Equivariant Graph Matching Networks with Conformation Flexibility for Drug Binding

Pre-training of Equivariant Graph Matching Networks with Conformation Flexibility for Drug Binding

Directed Weight Neural Networks for Protein Structure Representation Learning

EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex Structures

Contact Info

Product

Resources

About