2019
DOI: 10.1007/s13222-019-00313-y
|View full text |Cite
|
Sign up to set email alerts
|

Using the Semantic Web as a Source of Training Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
14
0
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(15 citation statements)
references
References 16 publications
0
14
0
1
Order By: Relevance
“…Among the over 98 billion RDF n-quads mentioned above, nearly 10% are related to products, and are described by schema.org vocabularies. As examples, previous studies (Meusel et al (2015); Bizer et al (2019)) showed that among all product offers, 95% had an n-quad related to their names, 65% had one for their description, 35% had one for their brand, and less than 10% had one for their category. We argue that such data can be used to create language resources that are potentially useful for a wide range of product data mining tasks.…”
Section: Introductionmentioning
confidence: 93%
See 2 more Smart Citations
“…Among the over 98 billion RDF n-quads mentioned above, nearly 10% are related to products, and are described by schema.org vocabularies. As examples, previous studies (Meusel et al (2015); Bizer et al (2019)) showed that among all product offers, 95% had an n-quad related to their names, 65% had one for their description, 35% had one for their brand, and less than 10% had one for their category. We argue that such data can be used to create language resources that are potentially useful for a wide range of product data mining tasks.…”
Section: Introductionmentioning
confidence: 93%
“…Therefore, an RDF n-quad can be considered as describing a 'fact' that are instantiations of concepts, properties and relationships. Studies have shown that such data can be used to train models for various NLP tasks, such as event extraction (Foley et al (2015)) and entity linking (Bizer et al (2019)).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…While originally applied in the context of relation extraction from text, it has been used for the task of augmenting a knowledge base from semi-structured web data, including web tables [4,10]. Bizer et al [2] make use of schema.org annotations extracted from 43 thousand e-shops to distantly supervise a deep neural network for product matching. To generate training pairs, they make use of generic product identifies that are often provided along the annotations.…”
Section: Related Workmentioning
confidence: 99%
“…La propia web de Schema.org cuantifica el despliegue de su uso en 10 millones de sitios web. Esto supone alrededor de menos del 1% de la Web, pero que es usado en cerca del 30% de los dominios(Bizer;Primpeli;Peeters, 2019).…”
unclassified