Yongqiang Dou scite author profile

Yongqiang Dou

4Publications

13Citation Statements Received

82Citation Statements Given

How they've been cited

How they cite others

Affiliations

Beijing Jingshida Electromechanical Equipment Research Institute, State Forestry and Grassland Administration

Publications

Order By: Most citations

VIMA: General Robot Manipulation with Multimodal Prompts

Jiang¹,

Gupta²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Prompt-based learning has emerged as a successful paradigm in natural language processing, where a single general-purpose language model can be instructed to perform any task specified by input prompts. Yet task specification in robotics comes in various forms, such as imitating one-shot demonstrations, following language instructions, and reaching visual goals. They are often considered different tasks and tackled by specialized models. This work shows that we can express a wide spectrum of robot manipulation tasks with multimodal prompts, interleaving textual and visual tokens. We design a transformer-based generalist robot agent, VIMA, that processes these prompts and outputs motor actions autoregressively. To train and evaluate VIMA, we develop a new simulation benchmark with thousands of procedurally-generated tabletop tasks with multimodal prompts, 600K+ expert trajectories for imitation learning, and four levels of evaluation protocol for systematic generalization. VIMA achieves strong scalability in both model capacity and data size. It outperforms prior SOTA methods in the hardest zero-shot generalization setting by up to 2.9× task success rate given the same training data. With 10× less training data, VIMA still performs 2.7× better than the top competing approach. We open-source all code, pretrained models, dataset, and simulation benchmark at https://vimalabs.github.io.

show abstract

Research on Sliding Characteristics of Planetary Roller Screw Mechanism

Chen

Shi

Dou

et al. 2022

View full text Add to dashboard Cite

Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection

Dou

Yang

et al. 2021

View full text Add to dashboard Cite

Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection

Dou

Yang

et al. 2020

Preprint

View full text Add to dashboard Cite

It becomes urgent to design effective anti-spoofing algorithms for vulnerable automatic speaker verification systems due to the advancement of high-quality playback devices. Current studies mainly treat anti-spoofing as a binary classification problem between bonafide and spoofed utterances, while lack of indistinguishable samples makes it difficult to train a robust spoofing detector. In this paper, we argue that for antispoofing, it needs more attention for indistinguishable samples over easily-classified ones in the modeling process, to make correct discrimination a top priority. Therefore, to mitigate the data discrepancy between training and inference, we propose to leverage a balanced focal loss function as the training objective to dynamically scale the loss based on the traits of the sample itself. Besides, in the experiments, we select three kinds of features that contain both magnitude-based and phase-based information to form complementary and informative features. Experimental results on the ASVspoof2019 dataset demonstrate the superiority of the proposed methods by comparison between our systems and top-performing ones. Systems trained with the balanced focal loss perform significantly better than conventional cross-entropy loss. With complementary features, our fusion system with only three kinds of features outperforms other systems containing five or more complex single models by 22.5% for min-tDCF and 7% for EER, achieving a min-tDCF and an EER of 0.0124 and 0.55% respectively. Furthermore, we present and discuss the evaluation results on real replay data apart from the simulated ASVspoof2019 data, indicating that research for anti-spoofing still has a long way to go.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yongqiang Dou

VIMA: General Robot Manipulation with Multimodal Prompts

Research on Sliding Characteristics of Planetary Roller Screw Mechanism

Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection

Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection

Contact Info

Product

Resources

About