56th Annual IEEE/ACM International Symposium on Microarchitecture 2023
DOI: 10.1145/3613424.3614303
|View full text |Cite
|
Sign up to set email alerts
|

TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs

Haotian Tang,
Shang Yang,
Zhijian Liu
et al.

Abstract: Sparse convolution plays a pivotal role in emerging workloads, including point cloud processing in AR/VR, autonomous driving, and graph understanding in recommendation systems. Since the computation pattern is sparse and irregular, specialized high-performance kernels are required. Existing GPU libraries offer two dataflow types for sparse convolution. The gather-GEMM-scatter dataflow is easy to implement but not optimal in performance, while the dataflows with overlapped computation and memory access (e.g. im… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
references
References 43 publications
0
0
0
Order By: Relevance