2020
DOI: 10.48550/arxiv.2005.11981
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The OpenCitations Data Model

Abstract: A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data supplier or context application. In this paper we present the OpenCitations Data Model (OCDM), a generic data model for describing bibliographic entities and citations, de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…This section introduces the benchmark datasets OC-782K and AMiner-534K which are created for evaluating the LAND framework. OC-782K is a subset of the Scientometrics KG [20] which is built in compliance with the OpenCitations Data Model (OCDM) [7]. On the other hand, AMiner-534K is a KG generated from a well-established benchmark dataset 2 for AND made available by AMiner in [31].…”
Section: Creation Of the Scholarly Kgsmentioning
confidence: 99%
See 1 more Smart Citation
“…This section introduces the benchmark datasets OC-782K and AMiner-534K which are created for evaluating the LAND framework. OC-782K is a subset of the Scientometrics KG [20] which is built in compliance with the OpenCitations Data Model (OCDM) [7]. On the other hand, AMiner-534K is a KG generated from a well-established benchmark dataset 2 for AND made available by AMiner in [31].…”
Section: Creation Of the Scholarly Kgsmentioning
confidence: 99%
“…This data model contains three types of entities: fabio:Expression, which represents articles, books, conference papers, and other academic works, fabio:Journal for representing journal venues (if the related fabio:Expression is a journal article), and authors which are described as foaf:Agent. The data model is an abstraction of the OCDM [7] and is created for two reasons: i) for collecting triples only related to the entities of interest (e.g. bibliographic resources, venues, and authors), ii) create an abstract representation of Scientometrics-OC in order to perform representation learning more efficiently.…”
Section: The Oc-782k Knowledge Graphmentioning
confidence: 99%