2010
DOI: 10.1016/j.jtbi.2009.10.033
|View full text |Cite
|
Sign up to set email alerts
|

The Burrows–Wheeler similarity distribution between biological sequences based on Burrows–Wheeler transform

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
14
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 25 publications
(14 citation statements)
references
References 57 publications
0
14
0
Order By: Relevance
“…If one sequence which is given the information contained in the other sequence is significantly compressible, the two sequences are considered to be close. There are also some important methods which are based on compression algorithm but do not actually apply the compression, such as Lemple-Ziv complexity and Burrows-Wheeler transform (Otu and Sayood, 2003;Mantaci et al, 2007Mantaci et al, , 2008Yang et al, 2010).…”
Section: Introductionmentioning
confidence: 99%
“…If one sequence which is given the information contained in the other sequence is significantly compressible, the two sequences are considered to be close. There are also some important methods which are based on compression algorithm but do not actually apply the compression, such as Lemple-Ziv complexity and Burrows-Wheeler transform (Otu and Sayood, 2003;Mantaci et al, 2007Mantaci et al, , 2008Yang et al, 2010).…”
Section: Introductionmentioning
confidence: 99%
“…Such capabilities of the BW T have originated the field of Compressed Full-text Self-indices [29,38]. The remarkable properties of the BW T have aroused great interest both from the theoretical and applicative points of view [30,31,35,48,27,7,26,44,17,40].…”
Section: Introductionmentioning
confidence: 99%
“…A class of similarity measures was defined by Mantaci et al [17] over an extension of the Burrows-Wheeler transform for string collections, called eBWT [16]. Later, Yang et al [34,35] recrafted the method by Mantaci et al and introduced the Burrows-Wheeler similarity distribution (BWSD) of two strings S 1 and S 2 based on the BWT of their concatenation. The authors evaluated similarity measures based on the expectation and Shannon entropy of the BWSD to efficiently construct phylogenetic trees for DNA and protein sequences, thus contributing to an alternative to alignment-based similarity measure among biological sequences.…”
Section: Introductionmentioning
confidence: 99%
“…In this article we present two new algorithms to compute the Burrows-Wheeler similarity distribution and we show how to efficiently compute BWSDbased distances among all pairs of strings in a collection. Our algorithms compute the BWT for the concatenation of all strings only once, instead of the pairwise construction of BWTs proposed by Yang [34,35], and use compressed data structures that allow reductions of the running time while still keeping a small memory usage, as shown by a set of experiments with real and artificial datasets. We also present both space-efficient alternatives and parallel versions of our algorithms, that achieved good time/space trade-off and speedup factors in our experiments, thus enabling the evaluation of the measure at larger scales.…”
Section: Introductionmentioning
confidence: 99%