2020
DOI: 10.1101/2020.01.28.922989
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Producing Polished Prokaryotic Pangenomes with the Panaroo Pipeline

Abstract: Population-level comparisons of prokaryotic genomes must take into account the substantial differences in gene content, resulting from frequent horizontal gene transfer, gene duplication and gene loss. However, the automated annotation of prokaryotic genomes is imperfect, and errors due to fragmented assemblies, contamination, diverse gene families and mis-assemblies accumulate over the population, leading to profound consequences when analysing the set of all genes found in a species. Here we introduce Panaro… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

3
137
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
5

Relationship

3
7

Authors

Journals

citations
Cited by 118 publications
(147 citation statements)
references
References 50 publications
3
137
0
Order By: Relevance
“…Plasmids were annotated using Prokka (version 1.14.6) [39]. Pangenome analysis used Panaroo (version 1.2.2) [40]. Core genes, soft core genes, shell genes and cloud genes are those found in [100, 99], (99, 95], (95, 15], and (15, 0] percent of sequences respectively.…”
Section: Discussionmentioning
confidence: 99%
“…Plasmids were annotated using Prokka (version 1.14.6) [39]. Pangenome analysis used Panaroo (version 1.2.2) [40]. Core genes, soft core genes, shell genes and cloud genes are those found in [100, 99], (99, 95], (95, 15], and (15, 0] percent of sequences respectively.…”
Section: Discussionmentioning
confidence: 99%
“…Additionally, the pan-genome functional analysis was carried out 120 utilizing KEGG, COG metabolic and functional pathways, which were all visualised with LibreOffice Calc plot functions. Panaroo (26) and PopPUNK (27) were also employed for pangenome investigation of gene profiles and clustering core and accessory genomes.…”
Section: Bioinformatics Analysismentioning
confidence: 99%
“…We did this using a custom mapping, variant calling, and local realignment around indels pipeline using bwa-MEM (Li, 2013), samtools mpileup (Li, 2011), and MUSCLE (Edgar, 2004), and then used the resulting whole-genome pseudo alignments to infer phylogenies for each cluster, using gubbins (Croucher et al, 2014), and RAxML, its underlying dependency (Stamatakis, 2014). In order to construct a phylogeny for the entire collection, and also the collection combined with global isolates, we used the newly published panaroo pan-genome pipeline (Tonkin-Hill et al, 2020) to infer a set of core genes for the entire collection. We concatenated alignments of all the core genes, and then used IQ-TREE (Nguyen et al, 2014), with the substitution model, GTR+F+I+G4, inferred by ModelFinder (Kalyaanamoorthy et al, 2017), to infer phylogenies for both the entire Burkina Faso collection and the Burkina Faso collection plus global isolates.…”
Section: R a F Tmentioning
confidence: 99%