Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems 2022
DOI: 10.1145/3503222.3507735
|View full text |Cite
|
Sign up to set email alerts
|

NASPipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism

Abstract: Supernet training, a prevalent and important paradigm in Neural Architecture Search, embeds the whole DNN architecture search space into one monolithic supernet, iteratively activates a subset of the supernet (i.e., a subnet) for fitting each batch of data, and searches a high-quality subnet which meets specific requirements.Although training subnets in parallel on multiple GPUs is desirable for acceleration, there inherently exists a race hazard that concurrent subnets may access the same DNN layers. Existing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…Especially for methods that explore large solution spaces such as the neural architecture search (NAS) [3,4], the problem becomes even more significant. This problem mandates the use of model parallelism [5,6], which creates substantial throughput loss with inevitable pipeline bubbles.…”
Section: Introductionmentioning
confidence: 99%
“…Especially for methods that explore large solution spaces such as the neural architecture search (NAS) [3,4], the problem becomes even more significant. This problem mandates the use of model parallelism [5,6], which creates substantial throughput loss with inevitable pipeline bubbles.…”
Section: Introductionmentioning
confidence: 99%