2017
DOI: 10.1109/tkde.2017.2751606
|View full text |Cite
|
Sign up to set email alerts
|

Automating Characterization Deployment in Distributed Data Stream Management Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 24 publications
(21 citation statements)
references
References 24 publications
0
18
0
Order By: Relevance
“…Past works focused on different aspects of the problem: [38] investigates the scheduling problem satisfying load balancing and cost considerations; [68] employs design-time knowledge and benchmarking method to deal with scheduling on heterogeneous clusters; Re-Stream [83] focuses on energy-efficient resource scheduling; [93] addresses adaptive scheduling. Others focus on automatically optimizing and tuning workloads using various techniques such as cost-based (e.g., [43,50,87]) or ML-based (e.g., [16,89,90]).…”
Section: Cloud Resource Management and Tuningmentioning
confidence: 99%
“…Past works focused on different aspects of the problem: [38] investigates the scheduling problem satisfying load balancing and cost considerations; [68] employs design-time knowledge and benchmarking method to deal with scheduling on heterogeneous clusters; Re-Stream [83] focuses on energy-efficient resource scheduling; [93] addresses adaptive scheduling. Others focus on automatically optimizing and tuning workloads using various techniques such as cost-based (e.g., [43,50,87]) or ML-based (e.g., [16,89,90]).…”
Section: Cloud Resource Management and Tuningmentioning
confidence: 99%
“…Users will submit their applications to the DSPS in the form of a query plan of certain format. The query plan is immediately converted into a directed acyclic graph (DAG) consisting of tasks which are running on the DSPS [8]. DSPS include three main data processing levels and they are:…”
Section: Distributed Stream Processing Systems (Dsps)mentioning
confidence: 99%
“…OrientStream [117] is a framework that uses incremental machine learning approaches for modeling and predicting the resource usage (i.e., CPU, memory, latency, and throughput) of workloads in distributed stream processing systems. Specifically, OrientStream uses an ensemble of four incremental learning models-namely, Naive Bayes, HoeffdingTree, Online bagging, and Nearest neighbors, which are trained using a variety of data-, plan-, operator-, and cluster-level features.…”
Section: Stream Processing Systemsmentioning
confidence: 99%