2016
DOI: 10.1007/978-3-319-46547-0_20
|View full text |Cite
|
Sign up to set email alerts
|

A Collection of Benchmark Datasets for Systematic Evaluations of Machine Learning on the Semantic Web

Abstract: Abstract.Resource type: Datasets Permanent URL: http://w3id.org/sw4ml-datasets In the recent years, several approaches for machine learning on the Semantic Web have been proposed. However, no extensive comparisons between those approaches have been undertaken, in particular due to a lack of publicly available, acknowledged benchmark datasets. In this paper, we present a collection of 22 benchmark datasets of different sizes.Such a collection of datasets can be used to conduct quantitative performance testing a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
67
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
1
1

Relationship

3
6

Authors

Journals

citations
Cited by 65 publications
(67 citation statements)
references
References 14 publications
0
67
0
Order By: Relevance
“…Datasets We evaluate our model on four datasets 3 in Resource Description Framework (RDF) format (Ristoski, de Vries, and Paulheim 2016): AIFB, MUTAG, BGS, and AM. Relations in these datasets need not necessarily encode directed subject-object relations, but are also used to encode the presence, or absence, of a specific feature for a given entity.…”
Section: Entity Classification Experimentsmentioning
confidence: 99%
See 1 more Smart Citation
“…Datasets We evaluate our model on four datasets 3 in Resource Description Framework (RDF) format (Ristoski, de Vries, and Paulheim 2016): AIFB, MUTAG, BGS, and AM. Relations in these datasets need not necessarily encode directed subject-object relations, but are also used to encode the presence, or absence, of a specific feature for a given entity.…”
Section: Entity Classification Experimentsmentioning
confidence: 99%
“…The exact statistics of the datasets can be found in Table 1. For a more detailed description of the datasets the reader is referred to Ristoski, de Vries, and Paulheim (2016). We remove relations that were used to create entity labels: employs and affiliation for AIFB, isMutagenic for MUTAG, hasLithogenesis for BGS, and objectCategory and material for AM.…”
Section: Entity Classification Experimentsmentioning
confidence: 99%
“…We use the entity embeddings on ve di erent datasets from di erent domains, for the tasks of classi cation and regression [34].…”
Section: Datasetsmentioning
confidence: 99%
“…More details about the evaluation datasets and how the datasets were generated are presented in [28].…”
Section: Datasetsmentioning
confidence: 99%