2020
DOI: 10.48550/arxiv.2005.02563
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

Abstract: High quality AI solutions require joint optimization of AI algorithms and their hardware implementations. In this work, we are the first to propose a fully simultaneous, Efficient Differentiable DNN (deep neural network) architecture and implementation cosearch (EDD) methodology. We formulate the co-search problem by fusing DNN search variables and hardware implementation variables into one solution space, and maximize both algorithm accuracy and hardware implementation quality. The formulation is differentiab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(27 citation statements)
references
References 22 publications
0
27
0
Order By: Relevance
“…Nevertheless, a common drawback in these RL-based algorithm-hardware codesign approaches is that they demand significant number of GPU hours for search of both algorithm and hardware-defining parameters, which is unfeasible for real-life applications. To reduce the search cost, differentiable NAS (DNA) has been used in algorithm and hardware co-design [13]. However, it has been demonstrated in [16] that the NNs found by DNA can only achieve similar accuracy to the NNs generated by random search.…”
Section: A Algorithm and Hardware Co-designmentioning
confidence: 99%
See 3 more Smart Citations
“…Nevertheless, a common drawback in these RL-based algorithm-hardware codesign approaches is that they demand significant number of GPU hours for search of both algorithm and hardware-defining parameters, which is unfeasible for real-life applications. To reduce the search cost, differentiable NAS (DNA) has been used in algorithm and hardware co-design [13]. However, it has been demonstrated in [16] that the NNs found by DNA can only achieve similar accuracy to the NNs generated by random search.…”
Section: A Algorithm and Hardware Co-designmentioning
confidence: 99%
“…1) Hardware Design Space: This paper adopts an example design which uses a single configurable processing unit to process different layers. Although there are other designs, such as the streaming design [13], [15] with layer-wise reconfigurability, they usually require a large amount of on-chip memory to cache all the intermediate results, which restricts the model size of the NN and limits the neural architecture space. In this paper, we adopted the single processing engine design, such that our search space encompasses larger CNNs.…”
Section: A Design Spacementioning
confidence: 99%
See 2 more Smart Citations
“…DNN Algorithm and Accelerator Co-exploration. Exploring the networks and the corresponding accelerators in a joint manner [1,31,39,40,50,92] has shown great potential towards maximizing both accuracy and efficiency. Recent works have extended NAS to jointly search DNN accelerators in addition to DNN structures.…”
Section: Related Workmentioning
confidence: 99%