2014
DOI: 10.32614/rj-2014-012
|View full text |Cite
|
Sign up to set email alerts
|

RStorm: Developing and Testing Streaming Algorithms in R

Abstract: Streaming data, consisting of indefinitely evolving sequences, are becoming ubiquitous in many branches of science and in various applications. Computer scientists have developed streaming applications such as Storm and the S4 distributed stream computing platform 1 to deal with data streams. However, in current production packages testing and evaluating streaming algorithms is cumbersome. This paper presents RStorm for the development and evaluation of streaming algorithms analogous to these production packag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…RStorm (Kaptein 2013) provides an environment to prototype bolts in R. Spouts are represented as data frames. Bolts developed in RStorm can currently not directly be used in Storm, but this is planned for the future (Kaptein 2014).…”
Section: Distributed Computing Frameworkmentioning
confidence: 99%
“…RStorm (Kaptein 2013) provides an environment to prototype bolts in R. Spouts are represented as data frames. Bolts developed in RStorm can currently not directly be used in Storm, but this is planned for the future (Kaptein 2014).…”
Section: Distributed Computing Frameworkmentioning
confidence: 99%
“…Practically, it has to be noted that at this moment not many off-the-shelf statistical packages are available to actually analyze data streams. The currently available software, for instance (and not exhaustive) Apache Storm (Toshniwal et al, 2014) Apache Spark (Karau, Konwinski, Wendell, & Zaharia, 2015), RStorm (Kaptein, 2014), S4 (Neumeyer, Robbins, Nair, & Kesari, 2010), RapidMiner (Hofmann & Klinkenberg, 2013), KNIME (Berthold et al, 2009), and MOA (Bifet, Holmes, Kirkby, & Pfahringer, 2010), often require extensive programming knowledge and focus mainly on the infrastructure of analyzing large datasets. There is still a large gap between the methods and software developed by computer scientists, and those that can be used by social scientists to analyze their data streams using models that they are accustomed to.…”
Section: Considerations Analyzing Big Data and Data Streamsmentioning
confidence: 99%