Xin Tan scite author profile

Abstract-Reinforcement learning provides a powerful and flexible framework for automated acquisition of robotic motion skills. However, applying reinforcement learning requires a sufficiently detailed representation of the state, including the configuration of task-relevant objects. We present an approach that automates state-space construction by learning a state representation directly from camera images. Our method uses a deep spatial autoencoder to acquire a set of feature points that describe the environment for the current task, such as the positions of objects, and then learns a motion skill with these feature points using an efficient reinforcement learning method based on local linear models. The resulting controller reacts continuously to the learned feature points, allowing the robot to dynamically manipulate objects in the world with closed-loop control. We demonstrate our method with a PR2 robot on tasks that include pushing a free-standing toy block, picking up a bag of rice using a spatula, and hanging a loop of rope on a hook at various positions. In each task, our method automatically learns to track task-relevant objects and manipulate their configuration with the robot's arm.

show abstract

Re-ID Driven Localization Refinement for Person Search

Han

Zhong

et al. 2019

129

View full text Add to dashboard Cite

Person search aims at localizing and identifying a query person from a gallery of uncropped scene images. Different from person re-identification (re-ID), its performance also depends on the localization accuracy of a pedestrian detector. The state-of-the-art methods train the detector individually, and the detected bounding boxes may be suboptimal for the following re-ID task. To alleviate this issue, we propose a re-ID driven localization refinement framework for providing the refined detection boxes for person search. Specifically, we develop a differentiable ROI transform layer to effectively transform the bounding boxes from the original images. Thus, the box coordinates can be supervised by the re-ID training other than the original detection task. With this supervision, the detector can generate more reliable bounding boxes, and the downstream re-ID model can produce more discriminative embeddings based on the refined person localizations. Extensive experimental results on the widely used benchmarks demonstrate that our proposed method performs favorably against the stateof-the-art person search methods.

show abstract

Hierarchical Modeling of Global Context for Document-Level Neural Machine Translation

Tan¹,

Zhang²,

Xiong³

et al. 2019

View full text Add to dashboard Cite

Document-level machine translation (MT) remains challenging due to the difficulty in efficiently using document context for translation. In this paper, we propose a hierarchical model to learn the global context for documentlevel neural machine translation (NMT). This is done through a sentence encoder to capture intra-sentence dependencies and a document encoder to model document-level intersentence consistency and coherence. With this hierarchical architecture, we feedback the extracted global document context to each word in a top-down fashion to distinguish different translations of a word according to its specific surrounding context. In addition, since largescale in-domain document-level parallel corpora are usually unavailable, we use a twostep training strategy to take advantage of a large-scale corpus with out-of-domain parallel sentence pairs and a small-scale corpus with in-domain parallel document pairs to achieve the domain adaptability. Experimental results on several benchmark corpora show that our proposed model can significantly improve document-level translation performance over several strong NMT baselines.

show abstract

Molecular and phenotypic spectrum of Noonan syndrome in Chinese patients

Yao

Tan

et al. 2019

Clinical Genetics

View full text Add to dashboard Cite

Noonan syndrome (NS) is a common autosomal dominant/recessive disorder. No large‐scale study has been conducted on NS in China, which is the most populous country in the world. Next‐generation sequencing (NGS) was used to identify pathogenic variants in patients that exhibited NS‐related phenotypes. We assessed the facial features and clinical manifestations of patients with pathogenic or likely pathogenic variants in the RAS‐MAPK signaling pathway. Gene‐related Chinese NS facial features were described using artificial intelligence (AI).NGS identified pathogenic variants in 103 Chinese patients in eight NS‐related genes: PTPN11 (48.5%), SOS1 (12.6%), SHOC2 (11.7%), KRAS (9.71%), RAF1 (7.77%), RIT1 (6.8%), CBL (0.97%), NRAS (0.97%), and LZTR1 (0.97%). Gene‐related facial representations showed that each gene was associated with different facial details. Eight novel pathogenic variants were detected and clinical features because of specific genetic variants were reported, including hearing loss, cancer risk due to a PTPN11 pathogenic variant, and ubiquitous abnormal intracranial structure due to SHOC2 pathogenic variants. NGS facilitates the diagnosis of NS, especially for patients with mild/moderate and atypical symptoms. Our study describes the genotypic and phenotypic spectra of NS in China, providing new insights into distinctive clinical features due to specific pathogenic variants.

show abstract

Rethinking Efficient Lane Detection via Curve Modeling

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xin Tan

Deep spatial autoencoders for visuomotor learning

Re-ID Driven Localization Refinement for Person Search

Hierarchical Modeling of Global Context for Document-Level Neural Machine Translation

Molecular and phenotypic spectrum of Noonan syndrome in Chinese patients

Rethinking Efficient Lane Detection via Curve Modeling

Contact Info

Product

Resources

About