2023
DOI: 10.3390/ai4040047
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning Performance Characterization on GPUs for Various Quantization Frameworks

Muhammad Ali Shafique,
Arslan Munir,
Joonho Kong

Abstract: Deep learning is employed in many applications, such as computer vision, natural language processing, robotics, and recommender systems. Large and complex neural networks lead to high accuracy; however, they adversely affect many aspects of deep learning performance, such as training time, latency, throughput, energy consumption, and memory usage in the training and inference stages. To solve these challenges, various optimization techniques and frameworks have been developed for the efficient performance of d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 49 publications
0
0
0
Order By: Relevance