2023
DOI: 10.1101/2023.05.13.540634
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unrestricted Versus Regulated Open Data Governance: A Bibliometric Comparison of SARS-CoV-2 Nucleotide Sequence Databases

Nathanael Sheehan,
Federico Botta,
Sabina Leonelli

Abstract: We analyse ongoing efforts to share genomic data about SARS-COV-2 through a comparison of the characteristics of the Global Initiative on Sharing All Influenza Data and the Covid-19 Data Portal with respect to the representativeness and governance of the research data therein. We focus on data and metadata on genetic sequences posted on the two infrastructures in the period between January 2020 and January 2023, thus capturing a period of acute response to the COVID-19 pandemic. Through a variety of data scien… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 81 publications
0
2
0
Order By: Relevance
“…It puts a premium on fast-paced research scaffolded by high-tech infrastructurefactors which, however, are not always or necessarily marks of quality and longterm reliability (Chen et al 2019, Leonelli 2021. It can result in the exclusion from data sharing initiatives of researchers who are not based at prestigious academic institutions, which in turn reduces the diversity and range of data available online, as well as the types of expertise viewed as significant when evaluating that evidence base (Sheehan et al 2023). These are significant ethical issues in terms of who is included and recognized as a participant in research; but far from being solely an ethical problem, this form of bias has also epistemic implications, since it substantially reduces both the diversity of data sources available to biomedical researchers globally and the amount of expertiseand particularly localized, regional expertiseput to the service of data modelling and interpretation.…”
Section: The Access Wars: Covid-19 Data Sharingmentioning
confidence: 99%
“…It puts a premium on fast-paced research scaffolded by high-tech infrastructurefactors which, however, are not always or necessarily marks of quality and longterm reliability (Chen et al 2019, Leonelli 2021. It can result in the exclusion from data sharing initiatives of researchers who are not based at prestigious academic institutions, which in turn reduces the diversity and range of data available online, as well as the types of expertise viewed as significant when evaluating that evidence base (Sheehan et al 2023). These are significant ethical issues in terms of who is included and recognized as a participant in research; but far from being solely an ethical problem, this form of bias has also epistemic implications, since it substantially reduces both the diversity of data sources available to biomedical researchers globally and the amount of expertiseand particularly localized, regional expertiseput to the service of data modelling and interpretation.…”
Section: The Access Wars: Covid-19 Data Sharingmentioning
confidence: 99%
“…Research on the genetic changes and mutations of respiratory viruses involved in such mixed infections is lacking, opening up unexplored opportunities for genetic variation. On the other hand, 6 million whole-genome sequences (WGS) of SARS-CoV-2 have been generated in EpiCoV by the Global Initiative for sharing of all influenza data (GISAID) [21]. This makes it possible to accurately calculate the evolution and transmission of SARS-CoV-2 [22,23].…”
Section: Introductionmentioning
confidence: 99%