2020 IEEE Winter Conference on Applications of Computer Vision (WACV) 2020
DOI: 10.1109/wacv45572.2020.9093511
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised and Semi-Supervised Domain Adaptation for Action Recognition from Drones

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
40
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 68 publications
(40 citation statements)
references
References 37 publications
0
40
0
Order By: Relevance
“…Task-specific: many aerial human detection datasets are aimed for specific tasks such as sport [23], search and rescue [94], synthetic data from game engines [132], and multiview [113]. AVI [127] is for violent recognition from aerial videos.…”
Section: Datasets For Aerial Action Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…Task-specific: many aerial human detection datasets are aimed for specific tasks such as sport [23], search and rescue [94], synthetic data from game engines [132], and multiview [113]. AVI [127] is for violent recognition from aerial videos.…”
Section: Datasets For Aerial Action Recognitionmentioning
confidence: 99%
“…3D CNNs: 3D CNNs are still the most popular networks for aerial action recognition. Among the modern networks, I3D [19] has been widely adopted for aerial action recognition [23], [27], [80], [98], [132]. C3D [139] has also been utilized for aerial action recognition [24], [98].…”
Section: Two-stream Cnnsmentioning
confidence: 99%
See 1 more Smart Citation
“…recently. Early attempts align distributions across the source and target domains using hand-crafted features [3,67], while recent deep learning based methods [18,37,9,6,10,36] leverage the insight from UDA on image classification and extend it to the video case. For instance, approaches [6,37] utilize adversarial feature alignment [14,54] and propose a temporal version with attention modules.…”
Section: Related Workmentioning
confidence: 99%
“…Existing methods have made significant progress in imagebased tasks, such as classification [33,14,54,42], semantic segmentation [16,53,56,31,38] and object detection [8,43,24,17]. While several works have sought to extend this success to video-based tasks like action recognition by aligning appearance (e.g., RGB) features through adversarial learning [6,9,37], challenges persist in video adaptation tasks due to the greater complexity of the video data. Moreover, different from the image data, domain shifts in videos for action recognition often involve more complicated environments, which increases the difficulty for adaptation.…”
Section: Introductionmentioning
confidence: 99%