2021
DOI: 10.1101/2021.04.15.439956
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Time-series trend of pandemic SARS-CoV-2 variants visualized using batch-learning self-organizing map for oligonucleotide compositions

Abstract: To confront the global threat of coronavirus disease 2019, a massive number of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome sequences have been decoded, with the results promptly released through the GISAID database. Based on variant types, eight clades have already been defined in GISAID, but the diversity can be far greater. Owing to the explosive increase in available sequences, it is important to develop new technologies that can easily grasp the whole picture of the big-sequence… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
3
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 26 publications
3
3
0
Order By: Relevance
“…The comparison of March 2020, when almost all clades first appeared, with January 2021 showed that their sequences presented differential locations, and August 2020 showed an intermediate pattern. The existence of such time-series changes shows that the separation on the BLSOM has biological significance, as previously reported by Abe et al (2021).…”
Section: Blsom Of Short Oligonucleotidessupporting
confidence: 80%
See 2 more Smart Citations
“…The comparison of March 2020, when almost all clades first appeared, with January 2021 showed that their sequences presented differential locations, and August 2020 showed an intermediate pattern. The existence of such time-series changes shows that the separation on the BLSOM has biological significance, as previously reported by Abe et al (2021).…”
Section: Blsom Of Short Oligonucleotidessupporting
confidence: 80%
“…When studying the 2009 swine-derived flu pandemic (H1N1/2009), we detected directional time-series changes in the oligonucleotide composition due to possible adaptations to the new host, namely, humans (Iwasaki et al, 2011a), and these findings have shown that near-future prediction was possible, albeit partially (Iwasaki et al, 2013a). We found similar time-series changes in the oligonucleotide composition for ebolavirus, MERS coronavirus (Wada et al, 2016(Wada et al, , 2017 and SARS-CoV-2 (Wada et al, 2020b;Iwasaki et al, 2021;Abe et al, 2021). In the present paper, SARS-CoV-2 genomes, which are of great interest to society, were used as an operative example to illustrate how this unsupervised explainable AI supports efficient knowledge discovery from big sequence data.…”
Section: Introductionsupporting
confidence: 60%
See 1 more Smart Citation
“…When studying the 2009 swine-derived flu pandemic (H1N1/2009), we detected directional timeseries changes in the oligonucleotide composition due to possible adaptations to a new host, namely humans (Iwasaki et al, 2011a), and these findings have shown that near-future prediction was possible, albeit partially (Iwasaki et al, 2013b). We found similar time-series changes in the oligonucleotide composition for Zaire ebolavirus, MERS coronavirus (Wada et al, 2016(Wada et al, , 2017 and SARS-CoV-2 (Ikemura et al, 2020;Wada et al, 2020b;Abe et al, 2021;Iwasaki et al, 2021).…”
Section: Introductionmentioning
confidence: 56%
“…We have recently analyzed time-series changes in short and long oligonucleotide compositions in a large number of SARS-CoV-2 genomes and found many oligonucleotides that are expanding rapidly in the virus population, which allowed us to predict candidate advantageous mutations for growth in human cells (Ikemura et al, 2020;Wada et al, 2020b;Iwasaki et al, 2021). Furthermore, the oligonucleotide BLSOM can classify the virus sequences into not only the known clades but also their subgroups (Abe et al, 2021). After the above-mentioned publications, a large number of sequences of this virus have continued to accumulate and will exceed one million.…”
Section: Analyses Of Sars-cov-2mentioning
confidence: 99%