2022
DOI: 10.1038/s41597-022-01348-9
|View full text |Cite
|
Sign up to set email alerts
|

CoV2K model, a comprehensive representation of SARS-CoV-2 knowledge and data interplay

Abstract: Since the outbreak of the COVID-19 pandemic, many research organizations have studied the genome of the SARS-CoV-2 virus; a body of public resources have been published for monitoring its evolution. While we experience an unprecedented richness of information in this domain, we also ascertained the presence of several information quality issues. We hereby propose CoV2K, an abstract model for explaining SARS-CoV-2-related concepts and interactions, focusing on viral mutations, their co-occurrence within variant… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 29 publications
0
10
0
Order By: Relevance
“…The VCM focuses on such structuring function for the purposes of characterizing viral genomic sequences. The model has already being successfully used for designing several search and visualization systems [14][15][16] and has been linked to a Phenotype Data Dictionary [17] and a knowledge base of SARS-CoV-2 mutations' impacts [18,19]. A broader application and adoption of the VCM within the context of virology research (by both life science domain experts and information systems' developers), however, requires that a more ontology-oriented approach is embraced, allowing the unambiguous identification of entities in the context of heterogeneous information.…”
Section: Introductionmentioning
confidence: 99%
“…The VCM focuses on such structuring function for the purposes of characterizing viral genomic sequences. The model has already being successfully used for designing several search and visualization systems [14][15][16] and has been linked to a Phenotype Data Dictionary [17] and a knowledge base of SARS-CoV-2 mutations' impacts [18,19]. A broader application and adoption of the VCM within the context of virology research (by both life science domain experts and information systems' developers), however, requires that a more ontology-oriented approach is embraced, allowing the unambiguous identification of entities in the context of heterogeneous information.…”
Section: Introductionmentioning
confidence: 99%
“…CoVEffect brings a number of tangible results to the scientific community, which we here describe. Immediate integrated use of our resulting annotated database was made within our CoV2K [ 22 ] system by updating the AA_changes, Variant, and Effect entities. Other data-driven analysis resources developed by our group (such as VirusViz [ 78 ] and ViruClust [ 79 ]) could immediately benefit from the addition of structured tuples connecting mutations and effects.…”
Section: Discussionmentioning
confidence: 99%
“…Effects are chosen from a taxonomy, that is, a controlled vocabulary of terms, including, for example, transmissibility, disease severity, resistance to antiviral drugs, or change in the protein kinetics (flexibility or stability properties). We previously proposed an initial version of this vocabulary [ 22 , 62 ], which has now evolved into a complete list of effects organized by category (“epidemiology,” “immunology,” “viral kinetics and dynamics,” or “diagnosis, prevention, and treatments”). The full list can be found the AdditionalFile1-effects-taxonomy [ 63 ].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the B.1.1 lineage [26] [27] arose in early 2020 which contained a double mutation in Ncap, Arg203Lys and Gly204Arg, [28] which spread throughout Europe and beyond. Further examples include Alpha (B.1.1.7) [29], Delta (B.1.617) [30] and Omicron (B.1.1.529) [31] amongst others [32] [33] [27]. Considerable ongoing sequence surveillance seems certain to identify further new variants [34] [35] [36] [37] [38].…”
Section: Introductionmentioning
confidence: 99%