2003
DOI: 10.1023/b:jcam.0000004603.15856.32
|View full text |Cite
|
Sign up to set email alerts
|

Representation, searching and discovery of patterns of bases in complex RNA structures

Abstract: SummaryWe describe a graph theoretic method designed to perform efficient searches for substructural patterns in nucleic acid structural coordinate databases using a simplified vectorial representation. Two vectors represent each nucleic acid base and the relative positions of bases with respect to one another are described in terms of distances between the defined start and end points of the vectors on each base. These points comprise the nodes and the distances the edges of a graph, and a pattern search can … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2005
2005
2015
2015

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 38 publications
(44 citation statements)
references
References 51 publications
0
44
0
Order By: Relevance
“…Now we reject some of these partial candidates by applying the subset screening criterion (10) with I = {1, 2, 3}; we reject (p, q, r) triples for which (15) because we can be certain that any m-nucleotide candidate for which (p, q, r) correspond to ( (p 1 ,…, p k , q) for (1, 2,…, k, k + 1) provided that (16) so that all pairwise constraints between the k existing nucleotides and the one new nucleotide are met. We then use the subset screening criterion; we reject those candidates for which: (17) Here S * is the corresponding sum for (p 1 ,…, p k ). Thus, when adding the next nucleotide, k pairwise constraints must be checked, and k additional terms must be summed to apply the subset screening criterion.…”
Section: Building Lists Of Partial Candidates and The Screening Algormentioning
confidence: 99%
See 2 more Smart Citations
“…Now we reject some of these partial candidates by applying the subset screening criterion (10) with I = {1, 2, 3}; we reject (p, q, r) triples for which (15) because we can be certain that any m-nucleotide candidate for which (p, q, r) correspond to ( (p 1 ,…, p k , q) for (1, 2,…, k, k + 1) provided that (16) so that all pairwise constraints between the k existing nucleotides and the one new nucleotide are met. We then use the subset screening criterion; we reject those candidates for which: (17) Here S * is the corresponding sum for (p 1 ,…, p k ). Thus, when adding the next nucleotide, k pairwise constraints must be checked, and k additional terms must be summed to apply the subset screening criterion.…”
Section: Building Lists Of Partial Candidates and The Screening Algormentioning
confidence: 99%
“…Harrison and co-workers also apply subgraph isomorphisms to search for motifs in graphs representing RNA 3D structure [17], using methods that were first developed for substructure searching libraries of small molecule structures and then applied to proteins, carbohydrates, and most recently, RNA [4]. For RNA structure searching, each base is represented by two vectors and the whole RNA structure as a labeled graph so the search problem is reduced to finding subgraph isomorphisms representing query motifs in graphs representing RNA 3D structures.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Cluster analysis of root-mean-square deviations between pairs of RNA fragments has been used as the basis for cluster analysis of RNA loop structures to group tetraloops into various families (Huang et al 2005). A graph theoretic approach has been used to search for substructure patterns in RNA structures using a vectorial approach (Harrison et al 2003). Finally, the Structural Classification of RNA (SCOR), database has been developed to organize and classify RNA structural motifs (Klosterman et al 2002;Tamura et al 2004).…”
Section: Tools For Identifying and Classifying Elements And Motifsmentioning
confidence: 99%
“…However, we have also made other contributions, e.g., in the application of chemoinformatics techniques to the representation and searching of biological macromolecules such as protein and RNA structures, [71][72][73][74][75] in the analysis of matched molecular pairs [76] and in charting the historical development of chemoinformatics and of its associated literature. [77][78][79][80] We have also sought to influence the development of the field by means of conferences and educational programmes.…”
Section: Other Contributionsmentioning
confidence: 99%