2020
DOI: 10.1371/journal.pone.0230416
|View full text |Cite
|
Sign up to set email alerts
|

The citation advantage of linking publications to research data

Abstract: Efforts to make research results open and reproducible are increasingly reflected by journal policies encouraging or mandating authors to provide data availability statements. As a consequence of this, there has been a strong uptake of data availability statements in recent literature. Nevertheless, it is still unclear what proportion of these statements actually contain well-formed links to data, for example via a URL or permanent identifier, and if there is an added value in providing such links. We consider… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

9
176
2
4

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 229 publications
(207 citation statements)
references
References 59 publications
9
176
2
4
Order By: Relevance
“…existing public or released with publication) was associated with 60.8% more citations per year than their private-data-only counterparts (95% CI: 28.1% -110.2%). Such a large effect is highly surprising, considering that prior studies in other fields [13,6,3] have found that releasing one's data is associated with only a 10%-30% increase in citations, where our study found a much larger effect from simply using public data. In our view, this illustrates the outsized importance of data in machine learning research, and suggests that the medical image computing field is highly catalyzed by the public release of imaging data.…”
Section: Papers Using Public Data Were Cited More Than 60% Morecontrasting
confidence: 67%
See 1 more Smart Citation
“…existing public or released with publication) was associated with 60.8% more citations per year than their private-data-only counterparts (95% CI: 28.1% -110.2%). Such a large effect is highly surprising, considering that prior studies in other fields [13,6,3] have found that releasing one's data is associated with only a 10%-30% increase in citations, where our study found a much larger effect from simply using public data. In our view, this illustrates the outsized importance of data in machine learning research, and suggests that the medical image computing field is highly catalyzed by the public release of imaging data.…”
Section: Papers Using Public Data Were Cited More Than 60% Morecontrasting
confidence: 67%
“…Very recently, Colavizza et al [3] conducted a text mining and citation analysis of more than half a million papers published by PLOS and BMC that were also part of the PubMed Open Access Collection. These journals are interesting cases because they each recently enacted policies requiring authors to include a Data Availability Statement (DAS) belonging to one of three categories: (1) "data is available on request", (2) "data is contained within the article or supplementary material", and (3) "data is in a public repository and here is the link".…”
Section: Related Workmentioning
confidence: 99%
“…An advantage of our approach compared to other studies that focus on specific journals that have a dedicated data availability statement (Federer et al 2018;Colavizza et al 2020) is the broader applicability. As our algorithm screens all parts of a publication, Open Data statements can be detected anywhere in the text.…”
Section: Discussionmentioning
confidence: 99%
“…On the level of individual researchers, Open Data can lead to increased visibility, resource efficiency (Pronk 2019;Milham et al 2018), and possibly a citation advantage (Colavizza et al 2020;Piwowar and Vision 2013). However, there are also limits to the open sharing of research data in the biomedical domain, especially when it comes to patient data that are subject to data protection regulations.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation