2019
DOI: 10.1016/j.future.2018.12.002
|View full text |Cite
|
Sign up to set email alerts
|

LADRA: Log-based abnormal task detection and root-cause analysis in big data processing with Spark

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
12
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 29 publications
(12 citation statements)
references
References 6 publications
0
12
0
Order By: Relevance
“…Along with these similar works, there are some researchers using statistical and machine learning methods for root-cause analysis. The authors of [30] introduce a Regression Neural Network (RNN) based algorithm to troubleshoot the causes of stragglers by processing Spark logs. More algorithms such as the associated tree and fuzzy data envelopment analysis [31] and Reinforcement Learning [32] are applied for finding the reasons of stragglers in Hadoop and Spark.…”
Section: Related Workmentioning
confidence: 99%
“…Along with these similar works, there are some researchers using statistical and machine learning methods for root-cause analysis. The authors of [30] introduce a Regression Neural Network (RNN) based algorithm to troubleshoot the causes of stragglers by processing Spark logs. More algorithms such as the associated tree and fuzzy data envelopment analysis [31] and Reinforcement Learning [32] are applied for finding the reasons of stragglers in Hadoop and Spark.…”
Section: Related Workmentioning
confidence: 99%
“…The anomalies are detected on a per task basis, by comparing the tasks to the average and standard deviation of task time in each stage. However, since the statistical approach does not provide good precision on classifying the root cause, the authors extend their work in a follow up paper [6]. In the extension, the authors develop a system that analyzes Spark application execution logs to identify resource bottlenecks, e.g., processor, memory, disk, and network, causing abnormally high execution times.…”
Section: Related Workmentioning
confidence: 99%
“…Monitoring data may not be always accessible from the user side since the monitoring tools are hard to install and tune. Hence, some studies focus on the offline strategy by analyzing logs instead of monitoring Lu et al [5]. Cluster managers, e.g., YARN in Vavilapalli et al [6], Isard et al [7], Verma et al [8], have different focuses.…”
Section: Introductionmentioning
confidence: 99%