Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images

Liu, Shichang; Ma, Ming; Li, Haiyang; Ning, Hanyang; Wang, Min

doi:10.3390/s22218331

Cited by 1 publication

(2 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Marlin et al [97] (2023.6) also employ MAE for learning facial features, with a focus on facial details and facial features. PoseMask [98] (2022.10) applies MAE for pose estimation in classroom scenarios, using heatmaps as reference masks to estimate poses in crowded or occluded scenes. Furthermore, Sheng et al [99] (2022.10) use MAE for feature extraction of gestures in Spatial-Temporal Motion Maps (STMM), improving gesture recognition accuracy.…”

Section: B Real-world and Unmodified Imagesmentioning

confidence: 99%

See 1 more Smart Citation

Masked Autoencoders in Computer Vision: A Comprehensive Survey

Zhou,

Liu

2023

IEEE Access

View full text Add to dashboard Cite

Masked autoencoders (MAE) is a deep learning method based on Transformer. Originally used for images, it has now been extended to video, audio, and some other temporal prediction tasks. In the field of computer vision, MAE performs well in classification, prediction, and target detection tasks. In terms of specific application, MAE has made many achievements in medical treatment, geography, 3D point cloud and machine troubleshooting. Since its introduction at the end of 2021, there have been more than 300 related preprints, and MAE has been significantly performed in tier one computer vision conferences during 2022 and 2023. In view of the current popularity of MAE and its future development prospects, we conduct a relatively comprehensive survey of MAE mainly covering officially published articles so far. We comb through and classify the improvements in MAE, demonstrating relatively representative applications in computer vision. Finally, as a summary, we discuss the possible future research directions and development areas based on the characteristics of MAE, hoping our work could be a reference for the future work of MAE.INDEX TERMS Computer vision survey, MAE, masked autoencoders, masked image modeling.

show abstract

Section: B Real-world and Unmodified Imagesmentioning

confidence: 99%

“…This means that even if the reconstructed results differ from the original, they still possess a certain level of coherence and can connect with contextual information. Therefore, MAE has significant applications in image generation [98], [104], [110], and also performs well in tasks that require temporal coherence [4], [135].…”

Section: B Applicationsmentioning

confidence: 99%

Masked Autoencoders in Computer Vision: A Comprehensive Survey

Zhou,

Liu

2023

IEEE Access

View full text Add to dashboard Cite

show abstract

Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images

Cited by 1 publication

References 30 publications

Masked Autoencoders in Computer Vision: A Comprehensive Survey

Masked Autoencoders in Computer Vision: A Comprehensive Survey

Contact Info

Product

Resources

About