2016
DOI: 10.1038/ncomms12797
|View full text |Cite
|
Sign up to set email alerts
|

Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes

Abstract: Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity hampers the use of traditional SNP-based methods for identifying all genetic associations with phenotypic variation. Here we introduce a computationally scalable and widely applicable statistical method (SEER) for the identification of sequence elements that are significantly enriched in a phenotype of interest. SEER is applicable to tens of thousands of genomes by counting variable-length k-mers using a distrib… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
160
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
3
2
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 183 publications
(160 citation statements)
references
References 50 publications
0
160
0
Order By: Relevance
“…Finally, we used k-mer distances 30 , mash 28 and andi 29 to create distance matrices. andi counts the number of mismatches between equally spaced maximal exact matches between a pair of sequences.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, we used k-mer distances 30 , mash 28 and andi 29 to create distance matrices. andi counts the number of mismatches between equally spaced maximal exact matches between a pair of sequences.…”
Section: Resultsmentioning
confidence: 99%
“…Hamming distance between informative k-mers using a subsample of 1% of counted k-mers from assemblies 30 .…”
Section: Methodsmentioning
confidence: 99%
“…[33, 34]). However, as for association methods [28, 35], the true impact of genetic features is confounded by their phylogenetic distribution and population structure. Therefore, approaches to distinguish causal resistance genes from all correlated markers require additional experimental study.…”
Section: Discussionmentioning
confidence: 99%
“…Such correlations can indicate that the genes involved have epistatic effects on fitness or that their presence or absence is a result of similar selective pressures. Finally, the output of Panaroo seamlessly interfaces with pyseer (v1.3.0), a bacterial GWAS package (24,50). pyseer includes a wide range of methods for performing association studies allowing for phenotypic associations to be found with gene or structural presence/absence patterns.…”
Section: Structural Variationmentioning
confidence: 99%