2018
DOI: 10.1145/3183558
|View full text |Cite
|
Sign up to set email alerts
|

Building the universal archive of source code

Abstract: A global collaborative project for the benefit of all.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
59
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 49 publications
(59 citation statements)
references
References 3 publications
0
59
0
Order By: Relevance
“…Software Heritage (SWH) was started in 2015 to collect, preserve and share the source code of all software ever written, together with its full development history [1]. As of today, it has already collected almost 6 billions unique source code files coming from over 85 million software origins that are regularly harvested.…”
Section: Reproducible Researchmentioning
confidence: 99%
“…Software Heritage (SWH) was started in 2015 to collect, preserve and share the source code of all software ever written, together with its full development history [1]. As of today, it has already collected almost 6 billions unique source code files coming from over 85 million software origins that are regularly harvested.…”
Section: Reproducible Researchmentioning
confidence: 99%
“…In this paper we describe an already operational system of identifiers that satisfies all these extra properties: high granularity, integrity, and no middleman. We argue that, when used in conjunction with the long-term archival provided by Software Heritage [2], such a system provides a suitable…”
Section: B References For Reuse and Reproducibilitymentioning
confidence: 99%
“…In order to develop an identifier system for billions of source code artifacts archived for the long-term in Software Heritage [2] we use a data model based on Merkle DAG (Direct Acyclic Graph) [3]. Nodes and edges are connected using hashing functions and represent the history of software development as captured by modern version control systems.…”
Section: Data Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…• technical solutions allowing creators of software to automatically publish releases of their software to a data repository, which provides a DOI and landing page for the software publication, e.g., the Zenodo repository automating archival of software releases on GitHub (guides.github.com/activities/citablecode/); • reliance on software archives harvesting open source code repositories, and providing unique identifiers for artifacts, e.g., Software Heritage [7].…”
Section: Challenges For the Retrieval Of Research Citation Graphsmentioning
confidence: 99%