Lam Huynh scite author profile

Recovering the scene depth from a single image is an ill-posed problem that requires additional priors, often referred to as monocular depth cues, to disambiguate different 3D interpretations. In recent works, those priors have been learned in an end-to-end manner from large datasets by using deep neural networks. In this paper, we propose guiding depth estimation to favor planar structures that are ubiquitous especially in indoor environments. This is achieved by incorporating a non-local coplanarity constraint to the network with a novel attention mechanism called depth-attention volume (DAV). Experiments on two popular indoor datasets, namely NYU-Depth-v2 and ScanNet, show that our method achieves state-of-the-art depth estimation results while using only a fraction of the number of parameters needed by the competing methods.

show abstract

StressNAS: Affect State and Stress Detection Using Neural Architecture Search

Huynh¹,

Nguyen

et al. 2021

View full text Add to dashboard Cite

Smartwatches have rapidly evolved towards capabilities to accurately capture physiological signals. As an appealing application, stress detection attracts many studies due to its potential benefits to human health. It is propitious to investigate the applicability of deep neural networks (DNN) to enhance human decision-making through physiological signals. However, manually engineering DNN proves a tedious task especially in stress detection due to the complex nature of this phenomenon. To this end, we propose an optimized deep neural network training scheme using neural architecture search merely using wrist-worn data from WESAD. Experiments show that our approach outperforms traditional ML methods by 8.22% and 6.02% in the three-state and two-state classifiers, respectively, using the combination of WESAD wrist signals. Moreover, the proposed method can minimize the need for humandesign DNN while improving performance by 4.39% (three-state) and 8.99% (binary). CCS CONCEPTS• Computing methodologies → Machine learning; • Applied computing → Life and medical sciences.

show abstract

Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion

Huynh

Nguyen-Ha

Matas

et al. 2021

View full text Add to dashboard Cite

Predicting Novel Views Using Generative Adversarial Query Network

Nguyen-Ha

Huynh

Rahtu

et al. 2019

View full text Add to dashboard Cite

The problem of predicting a novel view of the scene using an arbitrary number of observations is a challenging problem for computers as well as for humans. This paper introduces the Generative Adversarial Query Network (GAQN), a general learning framework for novel view synthesis that combines Generative Query Network (GQN) and Generative Adversarial Networks (GANs). The conventional GQN encodes input views into a latent representation that is used to generate a new view through a recurrent variational decoder. The proposed GAQN builds on this work by adding two novel aspects: First, we extend the current GQN architecture with an adversarial loss function for improving the visual quality and convergence speed. Second, we introduce a featurematching loss function for stabilizing the training procedure. The experiments demonstrate that GAQN is able to produce high-quality results and faster convergence compared to the conventional approach.Keywords: novel view synthesis · generative adversarial query network · mean feature matching loss

show abstract

Lightweight Monocular Depth with a Novel Neural Architecture Search Method

Huynh

Nguyen-Ha

Matas

et al. 2022

View full text Add to dashboard Cite

This paper presents a novel neural architecture search method, called LiDNAS, for generating lightweight monocular depth estimation models. Unlike previous neural architecture search (NAS) approaches, where finding optimized networks is computationally demanding, the introduced novel Assisted Tabu Search leads to efficient architecture exploration. Moreover, we construct the search space on a pre-defined backbone network to balance layer diversity and search space size. The LiDNAS method outperforms the state-of-the-art NAS approach, proposed for disparity and depth estimation, in terms of search efficiency and output model performance. The LiDNAS optimized models achieve result superior to compact depth estimation state-of-the-art on NYU-Depth-v2, KITTI, and Scan-Net, while being 7%-500% more compact in size, i.e the number of model parameters.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lam Huynh

Guiding Monocular Depth Estimation Using Depth-Attention Volume

StressNAS: Affect State and Stress Detection Using Neural Architecture Search

Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion

Predicting Novel Views Using Generative Adversarial Query Network

Lightweight Monocular Depth with a Novel Neural Architecture Search Method

Contact Info

Product

Resources

About