2020
DOI: 10.1101/2020.09.09.290247
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

rSWeeP: A R/Bioconductor package deal with SWeeP sequences representation

Abstract: The rSWeeP package is an R implementation of the SWeeP model, designed to handle Big Data. rSweeP meets to the growing demand for efficient methods of heuristic representation in the field of Bioinformatics, on platforms accessible to the entire scientific community. We explored the implementation of rSWeeP using a dataset containing 31,386 viral proteomes, performing phylogenetic and principal component analysis. As a case study we analyze the viral strains closest to the SARS-CoV, responsible for the current… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…Protein sequences were concatenated (with border delimiters) into proteomes which were represented in vectors using the SWeeP tool (Spaced Words Projection) (De Pierri et al, 2020 ). The R version of the SWeeP tool, used for the proteome vectorization, is available in the Bioconductor Platform 4 for R version 3.12 (Fernandes et al, 2020 ). Finally, we made the vector projection of the Brazilian genomes (coded in DNA) in the SWeeP tool in Matlab® (De Pierri et al, 2020 ) with its default parameters.…”
Section: Methodsmentioning
confidence: 99%
“…Protein sequences were concatenated (with border delimiters) into proteomes which were represented in vectors using the SWeeP tool (Spaced Words Projection) (De Pierri et al, 2020 ). The R version of the SWeeP tool, used for the proteome vectorization, is available in the Bioconductor Platform 4 for R version 3.12 (Fernandes et al, 2020 ). Finally, we made the vector projection of the Brazilian genomes (coded in DNA) in the SWeeP tool in Matlab® (De Pierri et al, 2020 ) with its default parameters.…”
Section: Methodsmentioning
confidence: 99%
“…Protein sequences were concatenated (with border delimiters) into proteomes which were represented in vectors using the SWeeP tool (Spaced Words Projection) (14). The R version of SWeeP tool, used for the proteome vectorization, is available in the Bioconductor Platform 4 for R version 3.12 (25). Finally, we made the vector projection of the Brazilian genomes (coded in DNA) in the SWeeP tool in Matlab (14) with its default parameters.…”
Section: Sequences Vectorial Representationmentioning
confidence: 99%