2005
DOI: 10.1007/11496656_17
|View full text |Cite
|
Sign up to set email alerts
|

DNA Compression Challenge Revisited: A Dynamic Programming Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
48
0
3

Year Published

2006
2006
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 75 publications
(51 citation statements)
references
References 8 publications
0
48
0
3
Order By: Relevance
“…Due to space limitations, we present here the most efficient algorithms, including BioCompress-2 (BioC) [12], GenCompress (GenC) [6], DNACompress (DNAC) [7], DNAPack (DNAP) [4], CDNA [15] and GeMNL [14]. Comparison with other DNA compressors can be found on the website: ftp://ftp.infotech.monash.edu.au/software/DNAcompress-XM/ The results of CDNA are reported for only 9 sequences in precision of two decimal places.…”
Section: Sequencementioning
confidence: 99%
See 1 more Smart Citation
“…Due to space limitations, we present here the most efficient algorithms, including BioCompress-2 (BioC) [12], GenCompress (GenC) [6], DNACompress (DNAC) [7], DNAPack (DNAP) [4], CDNA [15] and GeMNL [14]. Comparison with other DNA compressors can be found on the website: ftp://ftp.infotech.monash.edu.au/software/DNAcompress-XM/ The results of CDNA are reported for only 9 sequences in precision of two decimal places.…”
Section: Sequencementioning
confidence: 99%
“…The CTW+LZ algorithm developed by Matsumoto et al [16] encodes significantly long repeats by the substitution method, and encodes short repeats and non repeat areas by context tree weighting [23]. At the cost of time complexity, DNAPack Behzadi and Fessant [4] employs a dynamic programming approach to find repeats. Non-repeat regions are encoded by the best choice from an order 2 Markov model, context tree weighting, and naive 2 bits per symbol methods.…”
Section: Introductionmentioning
confidence: 99%
“…For these sequences which contain small scale of repetitive fragments, this type algorithm can not obtain better performance. On the other side, the reference algorithms [5,6] attract the interesting of researchers, which reason to the high compression rate of this type algorithm. In [5], the compression rate can be 80 when the human genome sequence are encoded by using the corresponding algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…However, for these genome sequences with low repetitive fragments , such as the microbial genome sequence, this type algorithm is not suitable. Actually, for these genome sequences, the context based entropy coding scheme [6][7][8][9][10] is more powerful than other types of algorithms.…”
Section: Introductionmentioning
confidence: 99%
“…To help manage large genomic databases, compression algorithms that capture and efficiently encode this repeated information are employed. Compression algorithms specific to DNA sequences have been around for some time [6,7,8,9,10,11,12]. How-ever, most existing algorithms are unsuitable for compressing large datasets of multiple sequences.…”
Section: Introductionmentioning
confidence: 99%