Proceedings of the 2009 International Database Engineering &Amp; Applications Symposium on - IDEAS '09 2009
DOI: 10.1145/1620432.1620434
|View full text |Cite
|
Sign up to set email alerts
|

Efficient discovery of join plans in schemaless data

Abstract: We describe a method of inferring join plans for a set of relation instances, in the absence of any metadata, such as attribute domains, attribute names, or constraints (e.g., keys or foreign keys). Our method enumerates the possible join plans in order of likelihood, based on the compatibility of a pair of columns and their suitability as join attributes (i.e. their appropriateness as keys). We outline two variants of the approach. The first variant is accurate but potentially time-consuming, especially for l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2017
2017

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…The data flow can be divided into phases as illustrated in Figure 2. First, the data is read from a data storage, such as files or relational database, and then constructed into a domain model (1). Domain model offers primitives for concepts such as truck, driver, and request.…”
Section: Data Flow In a Vehicle Routing Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…The data flow can be divided into phases as illustrated in Figure 2. First, the data is read from a data storage, such as files or relational database, and then constructed into a domain model (1). Domain model offers primitives for concepts such as truck, driver, and request.…”
Section: Data Flow In a Vehicle Routing Systemmentioning
confidence: 99%
“…To illustrate, one part of the dataset could consist of ordinary files that pertain to drivers and vehicles, and the other deliveries and locations. Finding semantic links between the relations in these datasets is what we call join inference, which in turn is based on foreign key discovery [1,41]. We propose join inference as a model that can learn the semantic links between a set of relations.…”
Section: Input Data → Domain Modelmentioning
confidence: 99%