2015 IEEE International Conference on Big Data (Big Data) 2015
DOI: 10.1109/bigdata.2015.7363983
|View full text |Cite
|
Sign up to set email alerts
|

SciSpark: Applying in-memory distributed computing to weather event detection and tracking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(22 citation statements)
references
References 5 publications
0
22
0
Order By: Relevance
“…In addition to the research highlights we presented in the previous sections, there are other research works which have been done using Apache Spark as a core engine for solving data problems in machine learning and data mining [5,36], graph processing [16], genomic analysis [60,65], time series data [71], smart grid data [73], spatial data processing [87], scientific computations of satellite data [67], large-scale biological sequence alignment [97] and data discretization [68]. There are also some recent works on using Apache Spark for deep learning [46,64].…”
Section: Related Researchmentioning
confidence: 99%
“…In addition to the research highlights we presented in the previous sections, there are other research works which have been done using Apache Spark as a core engine for solving data problems in machine learning and data mining [5,36], graph processing [16], genomic analysis [60,65], time series data [71], smart grid data [73], spatial data processing [87], scientific computations of satellite data [67], large-scale biological sequence alignment [97] and data discretization [68]. There are also some recent works on using Apache Spark for deep learning [46,64].…”
Section: Related Researchmentioning
confidence: 99%
“…Persistence Ordering: e second problem deals with ensuring that the execution order of dependent HTM transactions is correctly re ected in PM following crash recovery. As an example, consider the dependent transactions A, B, C in Listings 1, 2 & 3. e HTM will serialize their execution in some order: say A, B and C. e values of the transaction variables following the execution of A are given by the vector V 1 = [w, x, y, z] = [1, 1, 0, 0]; a er the execution of B the vector becomes V 2 = [2, 1, 2, 0] and nally following C it is 1,2,3]. Under normal operation the write backs of variables to PM from di erent transactions may become arbitrarily interleaved.…”
Section: Challenges Of Persistent Htm Transactionsmentioning
confidence: 99%
“…In parallel, current bibliography trends highlight the adoption of Spark for both key processing steps in large-scale imaging problems [10,33,11,34,35], as well as for parallelizing dedicated machine learning and optimization algorithms [36,37,38,39]. Specifically, with regard to imaging data management over Spark, SciSpark [10,33] pre-processes structured scientific data in network Common Format (netCDF) and Hierarchical Data Format (HDF). The result is a distributed com-puting array structure suitable for supporting iterative scientific algorithms for multidimensional data, with applications on Earth Observation and climate data for weather event detection.…”
Section: The Positioning Of Apache Spark In the Distributed Learning mentioning
confidence: 99%