Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022
DOI: 10.24963/ijcai.2022/153
|View full text |Cite
|
Sign up to set email alerts
|

Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation

Abstract: Accurate and real-time LiDAR semantic segmentation is necessary for advanced autonomous driving systems. To guarantee a fast inference speed, previous methods utilize the highly optimized 2D convolutions to extract features on the range view (RV), which is the most compact representation of the LiDAR point clouds. However, these methods often suffer from lower accuracy for two reasons: 1) the information loss during the projection from 3D points to the RV, 2) the semantic ambiguity when 3D points labels are as… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 37 publications
(15 citation statements)
references
References 0 publications
0
15
0
Order By: Relevance
“…Lite-HRNet 4 introduces a conditional channel weighting module consisting of channel attention mechanisms to substitute the computationally expensive pointwise convolutions in the shuffle block. Dite-HRNet 5 imparts dynamic attributes to the convolutional kernels, improving network computational efficiency and enhancing long-range spatial dependencies. However, the attention module adopted by the dynamic split convolution in Dite-HRNet closely resembles its proposed global context modeling module.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Lite-HRNet 4 introduces a conditional channel weighting module consisting of channel attention mechanisms to substitute the computationally expensive pointwise convolutions in the shuffle block. Dite-HRNet 5 imparts dynamic attributes to the convolutional kernels, improving network computational efficiency and enhancing long-range spatial dependencies. However, the attention module adopted by the dynamic split convolution in Dite-HRNet closely resembles its proposed global context modeling module.…”
Section: Related Workmentioning
confidence: 99%
“…However, its heightened complexity presents challenges for implementation on resource-constrained devices. Recent research endeavors 4 7 have been devoted to enhancing the efficiency of networks. Small HRNet 1 reduces the depth and width of HRNet 3 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Top-down pose estimation algorithms [15,16] belong to two-stage networks, which locate the specific position and size of instances through object detection, and then perform fine-grained estimation using single-person pose estimation methods to obtain the keypoints of the entire image instance. Mask RCNN [17] extracts image features, generates a series of candidate regions through Region Proposal Network (RPN), and then performs classification and keypoint prediction for each candidate region.…”
Section: Top-downmentioning
confidence: 99%
“…LiteHRNet [20] applies the shuffle block in ShuffleNet to HRNet to produce stronger performance in lightweight networks, while improving model inference speed. Dite-HRNet [15] proposes dynamic multi-scale context blocks and dynamic global context blocks to reduce model parameters while ensuring model accuracy. UULPN [21] designs a lightweight bottleneck module with a reparameterization structure to improve model accuracy while maintaining the same computational cost.…”
Section: Top-downmentioning
confidence: 99%