2017
DOI: 10.1101/195826
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SAMSA2: A standalone metatranscriptome analysis pipeline

Abstract: BackgroundComplex microbial communities are an area of rapid growth in biology. Metatranscriptomics allows one to investigate the gene activity in an environmental sample via high-throughput sequencing. Metatranscriptomic experiments are computationally intensive because the experiments generate a large volume of sequence data and the sequences must be compared with many references.

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(20 citation statements)
references
References 23 publications
0
20
0
Order By: Relevance
“…Metagenomic data were analyzed using Kraken2 and Bracken to identify species-level shifts in community composition (Lu et al, 2017;Wood and Salzberg, 2014). Metatranscriptomic reads were analyzed at the whole-community level with the HMP Unified metabolic access network 2 (HUMAnN2) (Franzosa et al, 2018) and simple annotation of Metatranscriptomes by sequence analysis 2 (SAMSA2) (Westreich et al, 2018) pipelines to identify differentially abundant pathways, subsystems, and transcripts. Lastly, a previously published pipeline (Deng et al, 2018) was utilized to align metatranscriptomic reads to the genomes of individual bacterial species to identify species-specific transcriptional changes among highly abundant members of the microbiota in response to different classes of antibiotics (Figure 1A).…”
Section: Resultsmentioning
confidence: 99%
“…Metagenomic data were analyzed using Kraken2 and Bracken to identify species-level shifts in community composition (Lu et al, 2017;Wood and Salzberg, 2014). Metatranscriptomic reads were analyzed at the whole-community level with the HMP Unified metabolic access network 2 (HUMAnN2) (Franzosa et al, 2018) and simple annotation of Metatranscriptomes by sequence analysis 2 (SAMSA2) (Westreich et al, 2018) pipelines to identify differentially abundant pathways, subsystems, and transcripts. Lastly, a previously published pipeline (Deng et al, 2018) was utilized to align metatranscriptomic reads to the genomes of individual bacterial species to identify species-specific transcriptional changes among highly abundant members of the microbiota in response to different classes of antibiotics (Figure 1A).…”
Section: Resultsmentioning
confidence: 99%
“…RNA-Seq, in principle, enables studies of both the composition of active microbial communities and the biological functions being expressed by the constituent members. While several pipelines for the analysis of metatranscriptome data have been published (34)(35)(36)(37), including evaluation of how the results from such data compare to those of amplicon-or whole-genome shotgun-based methods, these studies primarily focused on bacterial communities or relatively low-complexity species mixes or used the total RNA pool (21,37,38). Other studies have used both RNA-Seq and amplicon sequencing together to describe microbial communities from a taxonomic and functional perspective (18,39).…”
Section: Discussionmentioning
confidence: 99%
“…An argument against using point-and-click computational computational pipelines is the inability to assign appropriate cut-offs for genes of interest [7]. An advantage of creating one's own pipeline or modifying one that is open source, such as SAMSA2 [31], is the ability for users to define e-value thresholds and use customized databases. SAMSA2, although built for metatranscriptomes, also works to assess gene abundances in metagenomes with the omission of the rRNA removal step, which is unnecessary for metagenomes.…”
Section: Discussionmentioning
confidence: 99%
“…The SAMSA2 pipeline [55] was modified for metagenomics analysis with the following steps: 1) reads mapping to the human genome were first removed with BMTagger [23], 2) paired-end reads were merged using PEAR [56], 3) sequence adaptor contamination and low quality bases were removed using Trimmomatic [57] and 4) the quality reads were then annotated against a protein reference database using DIAMOND, a high-throughput squence aligner [10]. DIAMOND is highly sensitive and runs at a speed that is up to 20,000 times faster than BLASTX and up to 2,500 times faster with the "sensitive" option.…”
Section: Metagenomic Sequence Analysismentioning
confidence: 99%