2021
DOI: 10.3389/fgene.2021.656140
|View full text |Cite
|
Sign up to set email alerts
|

TMP- SSurface2: A Novel Deep Learning-Based Surface Accessibility Predictor for Transmembrane Protein Sequence

Abstract: Transmembrane protein (TMP) is an important type of membrane protein that is involved in various biological membranes related biological processes. As major drug targets, TMPs’ surfaces are highly concerned to form the structural biases of their material-bindings for drugs or other biological molecules. However, the quantity of determinate TMP structures is still far less than the requirements, while artificial intelligence technologies provide a promising approach to accurately identify the TMP surfaces, mere… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 43 publications
0
7
0
Order By: Relevance
“…This smaller dataset was termed 'MCP-Small'. To ensure that the training set is relatively sufcient [24,60], the 898 sequences were divided into three subsets: 718 randomly chosen membrane protein sequences formed the training set, 90 membrane protein sequences were used as the test set, and the other 90 membrane protein sequences were used as the validation set. Such a ∼8:1:1 dataset construction ensures a relatively larger training set with reasonable validation and test sets for better convergence, which was adopted and recommended by previous work [61,62].…”
Section: Generation Of the Dataset For The Mcp Model Trainingmentioning
confidence: 99%
“…This smaller dataset was termed 'MCP-Small'. To ensure that the training set is relatively sufcient [24,60], the 898 sequences were divided into three subsets: 718 randomly chosen membrane protein sequences formed the training set, 90 membrane protein sequences were used as the test set, and the other 90 membrane protein sequences were used as the validation set. Such a ∼8:1:1 dataset construction ensures a relatively larger training set with reasonable validation and test sets for better convergence, which was adopted and recommended by previous work [61,62].…”
Section: Generation Of the Dataset For The Mcp Model Trainingmentioning
confidence: 99%
“…Last, TMP-SSurface2 predicts the RSA for residues placed in a membrane environment rather than a direct exposure classification. Interestingly, the Pearson’s R in the same range as for our independent OPM test set (Figure D) was reported …”
Section: Discussionmentioning
confidence: 65%
“…69 The same predictors but with different relative performances were identified for the OPM test set. On the OPM test set, we also compared TopProperty RSA predictions to TMP-SSurface-2, 70 a recently published method specifically trained on TMPs. Although TopProperty was not trained specifically on TMPs, it performed equivalently or better compared to TMP-SSurface-2.…”
Section: ■ Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…This smaller dataset was termed ‘MCP-Small’. To ensure that the training set is relatively sufficient [ 24 , 62 ], the 898 sequences were divided into three subsets: 718 randomly chosen membrane protein sequences formed the training set, 90 membrane protein sequences were used as the test set, and the other 90 membrane protein sequences were used as the validation set. Such a ∼8:1:1 dataset construction ensures a relatively larger training set with reasonable validation and test sets for better convergence, which was adopted and recommended by previous work [ 63 , 64 ].…”
Section: Methodsmentioning
confidence: 99%