Learning-based risk assessment and motion estimation by vision for unmanned aerial vehicle landing in an unvisited area

Cheng, Hsiu Wen; Chen, Tsung Lin; Tien, Chung Hao

doi:10.1117/1.jei.28.6.063011

Cited by 4 publications

(1 citation statement)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Currently, motion estimation approaches are more commonly accomplished via deep learning, which has rapidly been adopted for VO applications such as scene tracking or optical flow for unmanned aerial vehicle (UAV) and robotics navigation, [14][15][16] many of which incorporate Fourier-domain or PC methods. 17-21 Supervised 22 and unsupervised 23 learning approaches have been demonstrated to predict the speed, depth, and position of system objects, even with a monocular imaging system.…”

Section: Introductionmentioning

confidence: 99%

Sequentially trained, shallow neural networks for real-time 3D odometry

Rodriguez,

Muminov,

Vuong

2023

Artificial Intelligence for Security and Defence Applications

View full text Add to dashboard Cite

Fourier-domain correlation approaches have been successful in a variety of image comparison approaches but fail when the scenes, patterns, or objects in the images are distorted. Here, we utilize the sequential training of shallow neural networks on Fourier-preprocessed video to infer 3-D movement. The bio-inspired pipeline learns x, y, and z-direction movement from high-frame-rate, low-resolution, Fourier-domain preprocessed inputs (either cross power spectra or phase correlation data). Our pipeline leverages the high sensitivity of Fourier methods in a manner that is resilient to the parallax distortion of a forward-facing camera. Via sequential training over several path trajectories, models generalize to predict the 3-D movement in unseen trajectory environments. Models with no hidden layer are less accurate initially but converge faster with sequential training over different flightpaths. Our results show important considerations and trade-offs between input data preprocessing (compression) and model complexity (convergence).

show abstract