Spatial cloud computing: how can the geospatial sciences use and help shape cloud computing?

Yang, Chaowei; Goodchild, Michael F.; Huang, Qunying; Nebert, Doug; Raskin, R.; Xu, Yang; Bambacus, Myra; Fay, Daniel

doi:10.1080/17538947.2011.587547

Cited by 309 publications

(172 citation statements)

References 55 publications

Supporting

Mentioning

168

Contrasting

Unclassified

Order By: Relevance

“…For our approach, the parallel sub-queries generated from a WFS query can be converted to map-tasks while the partial results of the map-tasks can be summarized by a reduce-task. Even though the Map-reduce model of Cloud computing introduces large runtime overhead, it can provide distributed computing capability in elastic and on demand manners by virtualizing and pooling computing resources [29]. By providing -computing as a service‖ for end users in a -pay-as-you-go‖ mode, cloud computing may be more convenient and budget and energy consumption efficient for improve the performance of the WFS systems of heavy workload.…”

Section: Discussionmentioning

confidence: 99%

Towards Improving Query Performance of Web Feature Services (WFS) for Disaster Response

Zhang

Zhao

2013

IJGI

View full text Add to dashboard Cite

While OGC's WFS facilitates disseminating heterogeneous spatial data over the Web and allows feature-level geospatial information sharing and synchronization, performance issues challenge the efficient and effective utilization of WFS for disaster response. Literature shows that obtaining spatial information becomes very slow when querying WFS systems from large geospatial databases over the Internet. Solutions on how to improve the WFS system performance so that spatial data can be delivered to disaster responders within a reasonable amount of time are needed. This paper proposes a parallel approach based on Voronoi diagram indexing and data/task parallelism for improving the query performance of WFS systems for disaster applications. Experimental results show that the parallel approach can significantly improve the response time needed to process the spatial queries from a massive volume of spatial data for disaster response.

show abstract

Section: Discussionmentioning

confidence: 99%

Towards Improving Query Performance of Web Feature Services (WFS) for Disaster Response

Zhang

Zhao

2013

IJGI

View full text Add to dashboard Cite

show abstract

“…In the face of such massive spatial data, the performance of traditional spatial join algorithms encounters a serious bottleneck. There is a growing consensus that improvements in high-performance computation will pave the new direction for distributed spatial analysis [7]. By deploying a high-performance spatial join computing framework, this study demonstrates that it is promising to leverage cutting-edge computing power for large-scale spatial relationship analysis.…”

Section: Introductionmentioning

confidence: 99%

A New Design of High-Performance Large-Scale GIS Computing at a Finer Spatial Granularity: A Case Study of Spatial Join with Spark for Sustainability

et al. 2016

View full text Add to dashboard Cite

Sustainability research faces many challenges as respective environmental, urban and regional contexts are experiencing rapid changes at an unprecedented spatial granularity level, which involves growing massive data and the need for spatial relationship detection at a faster pace. Spatial join is a fundamental method for making data more informative with respect to spatial relations. The dramatic growth of data volumes has led to increased focus on high-performance large-scale spatial join. In this paper, we present Spatial Join with Spark (SJS), a proposed high-performance algorithm, that uses a simple, but efficient, uniform spatial grid to partition datasets and joins the partitions with the built-in join transformation of Spark. SJS utilizes the distributed in-memory iterative computation of Spark, then introduces a calculation-evaluating model and in-memory spatial repartition technology, which optimize the initial partition by evaluating the calculation amount of local join algorithms without any disk access. We compare four in-memory spatial join algorithms in SJS for further performance improvement. Based on extensive experiments with real-world data, we conclude that SJS outperforms the Spark and MapReduce implementations of earlier spatial join approaches. This study demonstrates that it is promising to leverage high-performance computing for large-scale spatial join analysis. The availability of large-sized geo-referenced datasets along with the high-performance computing technology can raise great opportunities for sustainability research on whether and how these new trends in data and technology can be utilized to help detect the associated trends and patterns in the human-environment dynamics.

show abstract

“…It provides scalable storage to manage and organize continuously increasing geospatial data, elastically changing its processing capability to data parallel computing with an effective paradigm to integrate many heterogeneous geocomputing resources [12,13]. Thus far, the advancements of cloud-enabled geocomputing involve dealing with the intensities of data, computation, concurrent access, and spatiotemporal patterns [14].…”

Section: Cloud-based Big Geo-data Processingmentioning

confidence: 99%

An Effective NoSQL-Based Vector Map Tile Management Approach

Wan

Huang

Xia

2016

IJGI

View full text Add to dashboard Cite

Within a digital map service environment, the rapid growth of Spatial Big-Data is driving new requirements for effective mechanisms for massive online vector map tile processing. The emergence of Not Only SQL (NoSQL) databases has resulted in a new data storage and management model for scalable spatial data deployments and fast tracking. They better suit the scenario of high-volume, low-latency network map services than traditional standalone high-performance computer (HPC) or relational databases. In this paper, we propose a flexible storage framework that provides feasible methods for tiled map data parallel clipping and retrieval operations within a distributed NoSQL database environment. We illustrate the parallel vector tile generation and querying algorithms with the MapReduce programming model. Three different processing approaches, including local caching, distributed file storage, and the NoSQL-based method, are compared by analyzing the concurrent load and calculation time. An online geological vector tile map service prototype was developed to embed our processing framework in the China Geological Survey Information Grid. Experimental results show that our NoSQL-based parallel tile management framework can support applications that process huge volumes of vector tile data and improve performance of the tiled map service.

show abstract

Spatial cloud computing: how can the geospatial sciences use and help shape cloud computing?

Cited by 309 publications

References 55 publications

Towards Improving Query Performance of Web Feature Services (WFS) for Disaster Response

Towards Improving Query Performance of Web Feature Services (WFS) for Disaster Response

A New Design of High-Performance Large-Scale GIS Computing at a Finer Spatial Granularity: A Case Study of Spatial Join with Spark for Sustainability

An Effective NoSQL-Based Vector Map Tile Management Approach

Contact Info

Product

Resources

About