2004
DOI: 10.1089/cmb.2004.11.1050
|View full text |Cite
|
Sign up to set email alerts
|

Permutation Pattern Discovery in Biosequences

Abstract: Functionally related genes often appear in each others neighborhood on the genome, however the order of the genes may not be the same. These groups or clusters of genes may have an ancient evolutionary origin or may signify some other critical phenomenon and may also aid in function prediction of genes. Such gene clusters also aid toward solving the problem of local alignment of genes. Similarly, clusters of protein domains, albeit appearing in different orders in the protein sequence, suggest common functiona… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2007
2007
2020
2020

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(20 citation statements)
references
References 21 publications
0
20
0
Order By: Relevance
“…Jumbled pattern matching has numerous applications in the field of bioinformatics, such as alignments of sequences [3], SNP discovery [5], discovery of repeated patterns [21], interpretation of mass spectrometry data [4] and permutation pattern discovery in biosequences [21].…”
Section: Applicationsmentioning
confidence: 99%
See 1 more Smart Citation
“…Jumbled pattern matching has numerous applications in the field of bioinformatics, such as alignments of sequences [3], SNP discovery [5], discovery of repeated patterns [21], interpretation of mass spectrometry data [4] and permutation pattern discovery in biosequences [21].…”
Section: Applicationsmentioning
confidence: 99%
“…In the case of discovery of repeated patterns [21], jumbled matching algorithms can be used to solve the problem of local alignment of genes.…”
Section: Gene Clustringmentioning
confidence: 99%
“…On the one hand, the factor complexity of a given infinite word w is a function ρ w : N → N that maps an integer n to the number of distinct factors of w of length n. On the other hand, the abelian complexity of w is a function ρ ab w : N → N that maps an integer n to the number of distinct Parikh vectors of factors of w of length n. Parikh vectors, which appear in the literature under various names such as compomers [5,6], jumbled patterns [8,7], permutation patterns [9,18,24], commutative closures [19], content vectors [19], to name a few, are vectors that record the frequency of each letter in the factors. So the abelian complexity counts the number of distinct factors of a given length up to letter permutation.…”
Section: Introductionmentioning
confidence: 99%
“…It turns out that the collection of sub-clusters S has an elegant form that can be captured as a nested structure (Eres et al, 2004) or a PQ tree (Landau et al, 2005), a data structure that was originally Computational Biology Center, IBM T.J. Watson Research Center, Yorktown Heights, New York.…”
Section: Introductionmentioning
confidence: 99%