2020
DOI: 10.1371/journal.pone.0232942
|View full text |Cite
|
Sign up to set email alerts
|

Vertical lossless genomic data compression tools for assembled genomes: A systematic literature review

Abstract: The recent decrease in cost and time to sequence and assemble of complete genomes created an increased demand for data storage. As a consequence, several strategies for assembled biological data compression were created. Vertical compression tools implement strategies that take advantage of the high level of similarity between multiple assembled genomic sequences for better compression results. However, current reviews on vertical compression do not compare the execution flow of each tool, which is constituted… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 111 publications
(188 reference statements)
0
4
0
Order By: Relevance
“…Kredens KV et.al. [24] presented a detailed systematic literature survey and Tang T and Li J [25] presented a comparative analysis of the existing compression tools for Genome sequences.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Kredens KV et.al. [24] presented a detailed systematic literature survey and Tang T and Li J [25] presented a comparative analysis of the existing compression tools for Genome sequences.…”
Section: Related Workmentioning
confidence: 99%
“…Input data: 11 24 11 43 24 11 Frequency of Input data is given as (u,v) where u represents the data and v represents the frequency of the data. Huffman Tree for the Input data 11 24 11 43 24 11 having the frequency (11,3), (24,2), (43,1) is shown in Fig. 4.…”
Section: Examplementioning
confidence: 99%
See 1 more Smart Citation
“…The development of short reads and long reads sequencing technologies has greatly reduced the cost of obtaining genomics sequencing data, the price dropping from $5292.390/MB in 2002 to $0.006/MB in 2022 ( Wetterstrand 2023 ). This has propelled rapid advancements in virus tracing, precision diagnosis treatment, and new drug development ( Hernaez et al 2019 , Kredens et al 2020 , Liu and Li 2021 , Sun et al 2023b ). As a result, the growth rate of sequencing data surpasses the Moore’s law ( Schaller 1997 , Hernaez et al 2019 ).…”
Section: Introductionmentioning
confidence: 99%