2014
DOI: 10.2174/1389202915999140328162433
|View full text |Cite
|
Sign up to set email alerts
|

A Brief Review: The Z-curve Theory and its Application in Genome Analysis

Abstract: In theoretical physics, there exist two basic mathematical approaches, algebraic and geometrical methods, which, in most cases, are complementary. In the area of genome sequence analysis, however, algebraic approaches have been widely used, while geometrical approaches have been less explored for a long time. The Z-curve theory is a geometrical approach to genome analysis. The Z-curve is a three-dimensional curve that represents a given DNA sequence in the sense that each can be uniquely reconstructed given th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
30
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 43 publications
(30 citation statements)
references
References 137 publications
(282 reference statements)
0
30
0
Order By: Relevance
“…VWZ method extracts Z-curve descriptors [ 67 ] of k-mer features (for k = 1, …, 6) and uses a partial least squares (PLS) classifier combined with an iterative feature selection procedure to eliminate irrelevant and highly-correlated features. The method was evaluated on two data sets: Dataset-1 contains 576 σ 70 promoters (positive samples) and 836 coding fragments (negative samples) of E. coli; Dataset-2 contains the same 576 σ 70 promoters (positive samples) and 825 non-coding fragments (negative samples) of E. coli.…”
Section: Resultsmentioning
confidence: 99%
“…VWZ method extracts Z-curve descriptors [ 67 ] of k-mer features (for k = 1, …, 6) and uses a partial least squares (PLS) classifier combined with an iterative feature selection procedure to eliminate irrelevant and highly-correlated features. The method was evaluated on two data sets: Dataset-1 contains 576 σ 70 promoters (positive samples) and 836 coding fragments (negative samples) of E. coli; Dataset-2 contains the same 576 σ 70 promoters (positive samples) and 825 non-coding fragments (negative samples) of E. coli.…”
Section: Resultsmentioning
confidence: 99%
“…Early attempts towards graphical representations of DNA were made by Hamori and Ruskin in 1983 [3], Hamori in 1985 [4], and Gates in 1985 [5]. Afterwards, more graphical representations of DNA sequences were well developed by researchers [1,2,[6][7][8][9][10][11][12][13][14][15]. In comparison with DNA, graphical representations of proteins emerged only very recently [2,[16][17][18][19][20][21][22][23][24][25][26][27].…”
Section: Introductionmentioning
confidence: 99%
“…In 2000, Randic et al [8] generalized these 2-D graphical representations to a 3-D graphical representation, in which the center of a cube is chosen as the origin of the Cartesian (x,y,z) coordinate system, and the four corners with coordinates (+1,´1,´1), (´1,+1,´1), (´1,´1,+1), and (+1,+1,+1) are assigned to the four bases. Some other graphical representations of bio-sequences and their applications in the field of biological science and technology can be found in [9][10][11][12][13][14][15][16][17][18][19][20][21][22][23][24].…”
Section: Introductionmentioning
confidence: 99%