49th International Conference on Parallel Processing - ICPP 2020
DOI: 10.1145/3404397.3404472
|View full text |Cite
|
Sign up to set email alerts
|

DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
6
1
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…DIESEL [35] and DIESEL+ [36] cache the metadata entirely in memory and cache the loaded data in memory. They double the training performance.…”
Section: Related Workmentioning
confidence: 99%
“…DIESEL [35] and DIESEL+ [36] cache the metadata entirely in memory and cache the loaded data in memory. They double the training performance.…”
Section: Related Workmentioning
confidence: 99%
“…These characteristics result in limited data reuse opportunities. This is in stark contrast to common benchmark models and datasets which systematically ingest the entire dataset across multiple epochs, and systems that optimize for these characteristics [24,29,36,47].…”
Section: Industry-scale Recsys Trainingmentioning
confidence: 97%
“…Similarly, DeepIO [52] and DLFS [53] leverage hardware support in the form of RDMA and NVMeOF to provide randomized minibatches from storage. DIESEL [47] co-designs storage and caching to provide efficient randomized minibatches for small files. Wang et al [48] and Kumar et al [30] explore mitigating data stalls on TPUs.…”
Section: Related Workmentioning
confidence: 99%
“…Other recent works target key DSI components, focusing on benchmark vision and NLP models. CoorDL [57], Quiver [47], and DIESEL [77] are caches that optimize for single-server training, HP tuning jobs, and small files, respectively. DeepIO [87] and DLFS [88] leverage hardware (RDMA and NVMeOF) to provide randomized minibatches from storage.…”
Section: Related Workmentioning
confidence: 99%