Proceedings of the 2017 ACM International Conference on Management of Data 2017
DOI: 10.1145/3035918.3035949
|View full text |Cite
|
Sign up to set email alerts
|

Extracting and Analyzing Hidden Graphs from Relational Databases

Abstract: Analyzing interconnection structures among underlying entities or objects in a dataset through the use of graph analytics can provide tremendous value in many application domains. However, graphs are not the primary representation choice for storing most data today, and in order to have access to these analyses, users are forced to manually extract data from their data stores, construct the requisite graphs, and then load them into some graph engine in order to execute their graph analysis task. Moreover, in m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
2
2
2

Relationship

1
5

Authors

Journals

citations
Cited by 27 publications
(28 citation statements)
references
References 47 publications
0
28
0
Order By: Relevance
“…Creating the edges here requires doing a self-join on the AuthorPublication table followed by a DISTINCT. A key challenge here is that the size of the self-join is often much larger than the size of the original AuthorPublication table [16]. Although this approach (Approach 3) is likely to be the most attractive in practice, there hasn't been much systematic work on understanding it; one of our goals here is to systematically explore it and discuss the key challenges that come up.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Creating the edges here requires doing a self-join on the AuthorPublication table followed by a DISTINCT. A key challenge here is that the size of the self-join is often much larger than the size of the original AuthorPublication table [16]. Although this approach (Approach 3) is likely to be the most attractive in practice, there hasn't been much systematic work on understanding it; one of our goals here is to systematically explore it and discuss the key challenges that come up.…”
Section: Resultsmentioning
confidence: 99%
“…Ringo [9] also provides operators for converting from an in-memory relational table representation to a graph representation, but is not intended as a layer on top of an RDBMS. Finally, in our prior work on GraphGen, we focused on the problem of exploring hidden graphs within relational databases [17] and dealing with large intermediate results that get generated in the process [16].…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In case of very large graphs, this basic setting becomes inefficient and mechanisms to speedup or guide search become essential. In literature various techniques including parallelization [13], declaratively specifying graph extraction tasks over database schemas [14], and introduction of mode declarations [15] have been proposed to speed-up graph based concept discovery systems or reduce the search space. In this study, we empirically evaluate performance of three bivariate statistical methods, namely frequency ratio (FR), hazard index (HI), and weight of evidence (WoE), as heuristics to guide search in relational pathfinding-based concept discovery systems.…”
Section: Introductionmentioning
confidence: 99%