Cache-Based Multi-Query Optimization for Data-Intensive Scalable Computing Frameworks

Michiardi, Pietro; Carra, Damiano; Migliorini, Sara

doi:10.1007/s10796-020-09995-2

Cited by 16 publications

(10 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Thus, common subexpressions are evaluated once. This approach was subsequently extended to include query result caches, materialized/cached views, intermediate query results, and query rewriting, which have been extensively studied for relational database systems [11], [14], [15], [28]- [31], [34], [35] and streaming processing systems [19]- [21]. Group processing algorithms have proven to be effective in multiple applications involving high-load conditions [7], [8], [15], [19]- [21], [27]- [31], [34]- [37], [52], [53].…”

Section: A Farthest Neighbor Search Algorithmsmentioning

confidence: 99%

Group Processing of Multiple k-Farthest Neighbor Queries in Road Networks

Cho

Attique

2020

IEEE Access

View full text Add to dashboard Cite

Advances in mobile technologies and map-based applications enables users to utilize sophisticated spatial queries, including k-nearest neighbor and shortest path queries. Often, location-based servers are used to handle multiple simultaneous queries because of the popularity of map-based applications. This study focuses on the efficient processing of multiple concurrent k-farthest neighbor (kFN) queries in road networks. For a positive integer k, query point q, and set of data points P , a kFN query returns k data points farthest from the query point q. For addressing multiple concurrent spatial queries, traditional locationbased servers based on one-query-at-a-time processing are unsuitable owing to high redundant computation costs. Therefore, we propose a group processing of multiple kFN (GMP) algorithm to process multiple kFN queries in road networks. The proposed GMP algorithm uses group computation to avoid the redundant computation of network distances between the query and data points. The experiments using real-world roadmaps demonstrate the proposed solution's effectiveness and efficiency. INDEX TERMS Spatial databases, group processing, multiple k-farthest neighbor query, road network.

show abstract

Section: A Farthest Neighbor Search Algorithmsmentioning

confidence: 99%

Group Processing of Multiple k-Farthest Neighbor Queries in Road Networks

Cho

Attique

2020

IEEE Access

View full text Add to dashboard Cite

show abstract

“…These multi-query optimization techniques later expanded to involve query rewriting, query result caches, materialized views, and intermediate query results for relational database systems [ 28 , 29 , 30 , 31 , 32 , 33 , 34 , 35 , 36 ] and streaming processing systems [ 37 , 38 , 39 ]. Many applications involving high-load conditions have proven that batch processing algorithms can significantly reduce the query processing time for multiple simultaneous queries [ 19 , 30 , 31 , 32 , 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 , 42 , 43 ]. Furthermore, multi-query optimization techniques have received significant attention in spatial databases.…”

Section: Related Workmentioning

confidence: 99%

A Unified Approach to Spatial Proximity Query Processing in Dynamic Spatial Networks

Cho¹

2021

Sensors

View full text Add to dashboard Cite

Nearest neighbor (NN) and range (RN) queries are basic query types in spatial databases. In this study, we refer to collections of NN and RN queries as spatial proximity (SP) queries. At peak times, location-based services (LBS) need to quickly process SP queries that arrive simultaneously. Timely processing can be achieved by increasing the number of LBS servers; however, this also increases service costs. Existing solutions evaluate SP queries sequentially; thus, such solutions involve unnecessary distance calculations. This study proposes a unified batch algorithm (UBA) that can effectively process SP queries in dynamic spatial networks. With the proposed UBA, the distance between two points is indicated by the travel time on the shortest path connecting them. The shortest travel time changes frequently depending on traffic conditions. The goal of the proposed UBA is to avoid unnecessary distance calculations for nearby SP queries. Thus, the UBA clusters nearby SP queries and exploits shared distance calculations for query clusters. Extensive evaluations using real-world roadmaps demonstrated the superiority and scalability of UBA compared with state-of-the-art sequential solutions.

show abstract

“…efficient data models, data processing pipelines and architectures to integrate standard and big data sources (Jovanovic et al 2020) as well as to improve resource utilization and aggregate performance in shared environments (Michiardi et al 2020); predictive analytics to forecast product demand in the fashion industry (Gardino et al 2020) and techniques to deal with the lack of annotated data for sensor-based human activity recognition (Prabono et al 2020); text data processing to assess the performance of text storage systems through a generic benchmark (Truicȃ et al 2020) and innovative solutions to deal with specific use cases such as the legal domain (Bordino et al 2020); novel approaches for mining social media to support intelligent transportation systems (Vallejos et al 2020) and digging deep the IoT scenario (Ustek-Spilda et al 2020); -solutions to deal with privacy issues in distance learning systems (Preuveneers et al 2020).…”

Section: Special Issue Contentmentioning

confidence: 99%

“…To gain a more efficient resource utilization and better aggregate performance in shared environments, where queries are concurrently submitted by multiple users, Multi-Query Optimization (MQO) techniques are adopted in paper (Michiardi et al 2020). The proposed system extends the SparkSQL Catalyst optimizer to provide a general approach to MQO for distributed computing frameworks that support a relational API.…”

Section: Efficient Data Models Data Processing Pipelines and Architementioning

confidence: 99%

Breakthroughs on Cross-Cutting Data Management, Data Analytics, and Applied Data Science

et al. 2020

View full text Add to dashboard Cite

Cache-Based Multi-Query Optimization for Data-Intensive Scalable Computing Frameworks

Cited by 16 publications

References 43 publications

Group Processing of Multiple k-Farthest Neighbor Queries in Road Networks

Group Processing of Multiple k-Farthest Neighbor Queries in Road Networks

A Unified Approach to Spatial Proximity Query Processing in Dynamic Spatial Networks

Breakthroughs on Cross-Cutting Data Management, Data Analytics, and Applied Data Science

Contact Info

Product

Resources

About