2020
DOI: 10.1007/s00778-020-00611-y
|View full text |Cite
|
Sign up to set email alerts
|

RDF graph summarization for first-sight structure discovery

Abstract: To help users get familiar with large RDF graphs, RDF summarization techniques can be used. In this work, we study quotient summaries of RDF graphs, that is: graph summaries derived from a notion of equivalence among RDF graph nodes. We make the following contributions: (i) four novel summaries which are often small and easy-to-comprehend, in the style of E-R diagrams; (ii) efficient (amortized linear-time) algorithms for computing these summaries either from scratch, or incrementally, reflecting additions to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
39
0

Year Published

2020
2020
2025
2025

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 26 publications
(39 citation statements)
references
References 32 publications
0
39
0
Order By: Relevance
“…Structural graph summaries are used for various different tasks such as cardinality estimations for queries in graph databases [46], data exploration [6,43,48,53], data visualization [25], vocabulary term recommendations [51], related entity retrieval [16], and query answering in data search [27]. The distinguishing characteristic of structural graph summaries is that they partition the set of vertices in a graph based on equivalences of subgraphs [10].…”
Section: Introductionmentioning
confidence: 99%
“…Structural graph summaries are used for various different tasks such as cardinality estimations for queries in graph databases [46], data exploration [6,43,48,53], data visualization [25], vocabulary term recommendations [51], related entity retrieval [16], and query answering in data search [27]. The distinguishing characteristic of structural graph summaries is that they partition the set of vertices in a graph based on equivalences of subgraphs [10].…”
Section: Introductionmentioning
confidence: 99%
“…In particular, according to what has been reported in [43], it is approximately 6 times larger than the more significant dataset processed by DistLODStat (i.e., 200 GB). Other profiling approaches, such as [20], experimented with real and synthetic graphs of up to 36.5 GB (approx 32 times smaller than makg), while [17] is evaluated on 6 datasets where the biggest one has the size of 56 MB (approx 21.125 times smaller than makg).…”
Section: Abstat-hd Versus Related Workmentioning
confidence: 99%
“…16 Moreover, it uses 5 external types from the fabio ontology 17 (Book, BookChapter, ConferencePaper, JournalArticle, and PatentDocument) and 25 external properties (from ontologies fabio, purl, 18 cito, 19 dbpedia, etc.). The Microsoft Academic Knowledge Graph maintainers have published also the schema 20 as an easier way to visualize relations among types and datatypes. From this schema, a user can easily notice that the KG makes use of two owl:equivalentClass: one between makg:FieldOfStudy and fabio:SubjectDiscipline and the other between makg:Paper and fabio:Work.…”
Section: Potential Errors Detected In the Makgmentioning
confidence: 99%
See 1 more Smart Citation
“…RDF summarization can be performed using a wide range of different techniques based in different dimensions of the target graph. Even if most of the current techniques rely at some point on concepts of node importance or relevance, some techniques, such as pattern-minning methods [ 42 ] or quotient summaries [ 43 , 44 ] may not use importance metrics.…”
Section: Related Workmentioning
confidence: 99%