1994
DOI: 10.1016/0306-4573(94)90014-0
|View full text |Cite
|
Sign up to set email alerts
|

A new challenge for compression algorithms: Genetic sequences

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
143
0
3

Year Published

2000
2000
2013
2013

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 231 publications
(147 citation statements)
references
References 10 publications
1
143
0
3
Order By: Relevance
“…2), 1,000 human genomes contain less than twice the unique information of one genome. Thus, although individual genomes are not very compressible 12,13 , collections of related genomes are extremely compressible [14][15][16][17] .…”
Section: Sublinear Analysis and Compressed Datamentioning
confidence: 99%
See 1 more Smart Citation
“…2), 1,000 human genomes contain less than twice the unique information of one genome. Thus, although individual genomes are not very compressible 12,13 , collections of related genomes are extremely compressible [14][15][16][17] .…”
Section: Sublinear Analysis and Compressed Datamentioning
confidence: 99%
“…As more divergent genomes are added to a database, Many algorithms exist for the compression of genomic data sets purely to reduce the space required for storage and transmission [12][13][14][15]17,18 . Hsi-Yang Fritz et al 18 provide a particularly instructive discussion of the concerns involved.…”
Section: Challenges Of Compressive Algorithmsmentioning
confidence: 99%
“…The compression of DNA sequences is not an easy task. (Grumback and Tahi 1994 [6], Rivals et al 1995 [7]; Chen et al 2000 [8]) DNA sequences consists of only four nucleotides bases {A, C, G, T}. Two bits are enough to store each base.…”
Section: Introductionmentioning
confidence: 99%
“…Our interest in compression here is not for saving file space or communication bandwidth, but in measuring the fit between a model and a sequence. Agarawal and States [11] and Grumbach and Tahi [12] recognised the relevance of compression to pattern discovery in biological sequences. Loewenstern and Yianilos [13] modified a popular file compression algorithm for use with DNA sequences by allowing a certain number of mismatches against past 'contexts'.…”
Section: Introductionmentioning
confidence: 99%