Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics 2019
DOI: 10.1145/3322798.3329254
|View full text |Cite
|
Sign up to set email alerts
|

Performance Prediction for Data Transfers in LCLS Workflow

Abstract: In this work, we study the use of decision tree-based models to predict the transfer rates in different parts of the data pipeline that sends experiment data from Linac Coherent Light Source (LCLS) at SLAC National Accelerator Laboratory (SLAC) to National Energy Research Scientific Computing Center (NERSC). The system monitoring the data pipeline collects a number of characteristics such as the file size, source file system, start time and so on, all of which are known at the start of the file transfer. Howev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 3 publications
0
3
0
Order By: Relevance
“…A number of works have addressed performance characterization of workflows running on NERSC resources from different aspects. Previous work [9] focuses on the data transfer part of the workflow and uses decision based techniques to predict the transfer rate of different file sizes from LCLS to NERSC. The decision is based on previously collected performance metrics such as transfer duration transfer start time, source file system, etc.…”
Section: B Related Workmentioning
confidence: 99%
“…A number of works have addressed performance characterization of workflows running on NERSC resources from different aspects. Previous work [9] focuses on the data transfer part of the workflow and uses decision based techniques to predict the transfer rate of different file sizes from LCLS to NERSC. The decision is based on previously collected performance metrics such as transfer duration transfer start time, source file system, etc.…”
Section: B Related Workmentioning
confidence: 99%
“…the average file size is greater than 40 GB, which takes around 55 seconds on average to move in, for a scientific application of Advanced Light Source (ALS) [1]. Linac Coherent Light Source (LCLS) at SLAC National Accelerator Laboratory produces terabytes of data for a single experiment, which is delivered to the National Energy Research Scientific Computing Center (NERSC) computing facility to process [5].…”
mentioning
confidence: 99%
“…Another crucial dimension for large data transfers is the accurate network performance prediction, since estimating data transfer time is vital for workflow scheduling and resource allocation. In that regard, the connection log would be a helpful resource to infer the current and future network performance, such as for change point and anomaly detection [1,10] and for throughput and packet loss prediction [4,5].…”
mentioning
confidence: 99%