2019 IEEE Information Theory Workshop (ITW) 2019
DOI: 10.1109/itw44776.2019.8988939
|View full text |Cite
|
Sign up to set email alerts
|

The Metagenomic Binning Problem: Clustering Markov Sequences

Abstract: The goal of metagenomics is to study the composition of microbial communities, typically using high-throughput shotgun sequencing. In the metagenomic binning problem, we observe random substrings (called contigs) from a mixture of genomes and want to cluster them according to their genome of origin. Based on the empirical observation that genomes of different bacterial species can be distinguished based on their tetranucleotide frequencies, we model this task as the problem of clustering N sequences generated … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…We first define two information-theoretic measures (Moulin and Veeravalli, 2018;Cover and Thomas, 2006;Greenberg and Shomorony, 2019) used in the properties following.…”
Section: Markov Model and Propertiesmentioning
confidence: 99%
See 1 more Smart Citation
“…We first define two information-theoretic measures (Moulin and Veeravalli, 2018;Cover and Thomas, 2006;Greenberg and Shomorony, 2019) used in the properties following.…”
Section: Markov Model and Propertiesmentioning
confidence: 99%
“…In order to develop a mathematically sound TNF-based orientation test, we utilize a probabilistic model for generating a genome with a given TNF. This model is based on a framework previously used to study the information-theoretic limits of metagenomic binning (Greenberg and Shomorony, 2019). Note that an i.i.d.…”
Section: Introductionmentioning
confidence: 99%