Proceedings of International Symposium on Grids &Amp; Clouds 2022 — PoS(ISGC2022) 2022
DOI: 10.22323/1.415.0009
|View full text |Cite
|
Sign up to set email alerts
|

Caching for dataset-based workloads with heterogeneous file sizes

Abstract: Caching can effectively reduce the cost of serving content and improve the user experience. In this paper, we explore the benefits of caching for existing scientific workloads, taking the Worldwide LHC (Large Hadron Collider) Computing Grid as an example. It is a globally distributed system that stores and processes multiple hundred petabytes of data and serves the needs of thousands of scientists around the globe. Scientific computation differs from other applications like video streaming as file sizes vary f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…Another key distinction lies in the optimization goal. While most of the scientific research papers deal with cached objects of the same size and, therefore, focus on optimizing File Miss Ratio (FMR) [7], our data involves considerably large file sizes with a wide distribution (figure 1). Hence, we are primarily interested in optimizing Byte Miss Ratio (BMR) -a metric that takes into account the impact of file size on cache performance.…”
Section: Motivationmentioning
confidence: 99%
See 2 more Smart Citations
“…Another key distinction lies in the optimization goal. While most of the scientific research papers deal with cached objects of the same size and, therefore, focus on optimizing File Miss Ratio (FMR) [7], our data involves considerably large file sizes with a wide distribution (figure 1). Hence, we are primarily interested in optimizing Byte Miss Ratio (BMR) -a metric that takes into account the impact of file size on cache performance.…”
Section: Motivationmentioning
confidence: 99%
“…However, it raises the question of whether a better BMR can be achieved through the adoption of alternative cache eviction policies. To explore the performance boundaries, we examine PFOO-U.Bytes [7], representing a tighter lower bound for BHR than an infinite-size cache [9]. In figure 3 it is marked as "Optimum (Lower bound)".…”
Section: Previous Workmentioning
confidence: 99%
See 1 more Smart Citation