2015
DOI: 10.1017/s0269888915000107
|View full text |Cite
|
Sign up to set email alerts
|

Federated query processing on linked data: a qualitative survey and open challenges

Abstract: A large number of data providers publish and connect their structured data on the Web as linked data. Thus, the Web of data becomes a global data space. In this paper, we initially give an overview of query processing approaches used in this interlinked and distributed environment, and then focus on federated query processing on linked data. We provide a detailed and clear insight on data source selection, join methods and query optimization methods of existing query federation engines. Furthermore, we present… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 29 publications
(24 citation statements)
references
References 54 publications
(116 reference statements)
0
24
0
Order By: Relevance
“…within a single repository [2, 10,128,117]. In a similar manner to a distributed database, given a query Q and a set of datasets (the sources), the query engine first selects the datasets relevant to the query [121,133] and then chooses between different approaches: aggregating the datasets locally, using distributed processing as in Hadoop [143], or a federated approach [107].…”
Section: Common Search Architecturesmentioning
confidence: 99%
“…within a single repository [2, 10,128,117]. In a similar manner to a distributed database, given a query Q and a set of datasets (the sources), the query engine first selects the datasets relevant to the query [121,133] and then chooses between different approaches: aggregating the datasets locally, using distributed processing as in Hadoop [143], or a federated approach [107].…”
Section: Common Search Architecturesmentioning
confidence: 99%
“…To fill this gap, in this paper we focus on methods, supported by special indexes, for performing tasks and measurements that involve more than two datasets. Such indexes and measurements are important: (a) for obtaining complete information about one particular URI (or set of URIs) with their provenance, (b) for obtaining measurements that are useful for dataset discovery and selection [6,17,18], (c) for assessing the connectivity between any set of datasets for quality checking and for monitoring their evolution over time [14], (d) for constructing visualizations [3] that provide more informative overviews and could also aid dataset discovery. Overall, the aforementioned tasks can assist data scientists since according to several studies they currently spend most of their time in collecting and preparing unruly digital data.…”
Section: Introductionmentioning
confidence: 99%
“…This can be explained as dividing a query into subqueries and sending it to semantic query endpoints. This allows the easy integration of linked data from different semantic query endpoints, in a standardized data model (RDF) [53]. This is one of the powerful features of the semantic web and geospatial semantic web, and it facilitates an easier way to build applications with a suitable combination of heterogeneous data from various resources.…”
Section: Geospatially Linked Data In Digital Gazetteersmentioning
confidence: 99%