2016
DOI: 10.2174/1574893611666151109175216
|View full text |Cite
|
Sign up to set email alerts
|

Characterization of Graphs for Protein Structure Modeling and Recognition of Solubility

Abstract: This paper deals with the relations among structural, topological, and chemical properties of the E.Coli proteome from the vantage point of the solubility/aggregation propensity of proteins. Each E.Coli protein is initially represented according to its known folded 3D shape. This step consists in representing the available E.Coli proteins in terms of graphs. We first analyze those graphs by considering pure topological characterizations, i.e., by analyzing the mass fractal dimension and the distribution underl… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
11
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(12 citation statements)
references
References 67 publications
(95 reference statements)
1
11
0
Order By: Relevance
“…The first ensemble of graphs contains PCNs, directly obtained from the 3D native structures resolved for the E. coli proteome [36,37]. Each vertex is defined as the alpha carbon of a residue; edges are added among two residues if their Euclidean distance is within the [4,8]Å range.…”
Section: Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…The first ensemble of graphs contains PCNs, directly obtained from the 3D native structures resolved for the E. coli proteome [36,37]. Each vertex is defined as the alpha carbon of a residue; edges are added among two residues if their Euclidean distance is within the [4,8]Å range.…”
Section: Datasetmentioning
confidence: 99%
“…The contribution of this paper consists in a two-step generative model for PCNs; the first stage of our method takes inspiration from the work of Bartoli et al [5]. The dataset considered in our study consists of four ensembles (classes) of networks: i) actual PCNs elaborated from the E. coli proteome [36,37], ii) synthetic networks generated according to the recipe of Bartoli et al [5], iii) synthetic modular networks generated with the method proposed by Sah et al [57], and finally iv) those generated with our method. We evaluate the soundness of the proposed approach by focusing on mesoscopic analyses.…”
Section: Introductionmentioning
confidence: 99%
“…The biological networks analysed in this work are partially linked to those analysed in our previous works [30,31]. We consider 400 E. coli protein contact networks (PCN) as the main object of study and we compare them to several models.…”
Section: The Considered Datamentioning
confidence: 99%
“…The analysis of large volumes of data is hampered by many technical problems, including the ones related to the quality and interpretation of associated information. One-class classifier design is an important research endeavour [1], [2] that can be used to tackle problems of anomaly/novelty detection or, more generally, to recognize outliers in incoming data [3]- [8]. Several different methods have been proposed in the literature, including clustering-based techniques, kernel methods, and statistical approaches (see [9] for a recent survey).…”
Section: Introductionmentioning
confidence: 99%
“…We show experimental results on both synthetic and realworld datasets for one-class classification, containing samples represented as feature vectors and labeled graphs. In this paper, in addition to evaluating the method on well-known benchmarks, we also face the challenging problem of protein solubility recognition [3]. Classification of proteins with respect to their solubility degree is a hard yet very important scientific problem, with consequences related to the folding of such macro-molecules [43].…”
Section: Introductionmentioning
confidence: 99%