2020 IEEE International Symposium on Information Theory (ISIT) 2020
DOI: 10.1109/isit44484.2020.9174113
|View full text |Cite
|
Sign up to set email alerts
|

Reconstruction of Strings from their Substrings Spectrum

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 21 publications
0
7
0
Order By: Relevance
“…These results are specifically discussed in remark 6, remark 7, and remark 9. Our work extends the growing list of recent work in string reconstruction problems [14]- [20].…”
Section: Introductionmentioning
confidence: 54%
“…These results are specifically discussed in remark 6, remark 7, and remark 9. Our work extends the growing list of recent work in string reconstruction problems [14]- [20].…”
Section: Introductionmentioning
confidence: 54%
“…Table 1 reports the data for human chromosome 1 up to k = 40. It appears remarkable that already in the range [28][29][30][31][32][33][34][35][36][37][38][39][40] for the k values, the genomic coverage of the k-segmentation is almost total (over positions where there is no N), and almost all the k-mers present in the chromosome are involved by kspectral maximal segments (see the normalized cardinality of set U k ), that is, they have the property to be univocally elongated in the k-spectrum. Essentially, we see that for k = 27 human chromosome 1 may be covered by 4 millions of relatively short (80 bp long) spectral maximal segments, which have of course a high value of multiplicity in the segmentation, whereas for k = 44 about half of the spectral maximal segments (1.985.225) of average length 157 cover the chromosome.…”
Section: Computational Resultsmentioning
confidence: 99%
“…In sequence analysis, the term "spectrum" is used in many contexts and with several meanings (see for example, [30,32,21,19,31]), in particular to tackle the problem of reconstructing a string from the compositions of its substrings. Other terms are used with a similar meaning, as bags of words, where a k-spectrum is represented by a multiplicity vector, where the i-component gives the multiplicity of the k-mer of position i in some prefixed (for example lexicographic) order [23].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors of [8] also proposed two encoding schemes of ℓ-repeat free strings; the first one uses a single redundancy symbol and supports ℓ = 2⌈log(n)⌉ + 2, while the second works for substrings of length ℓ = ⌈log n⌉ + ⌈2 log log n⌉ + 5 and its asymptotic rate approaches 1. Extensions of this problem to the setup where the ℓ-profiles are not received error-free were also studied recently [5], [9], [19], [28].…”
Section: Introductionmentioning
confidence: 99%