Miaosen Wang scite author profile

Miaosen Wang

5Publications

23Citation Statements Received

71Citation Statements Given

How they've been cited

How they cite others

117

Affiliations

Google (United States)

Publications

Order By: Most citations

Adversarial Training for Community Question Answer Selection Based on Multi-Scale Matching

Xiao¹,

Khabsa²,

Wang³

et al. 2019

AAAI

View full text Add to dashboard Cite

Community-based question answering (CQA) websites represent an important source of information. As a result, the problem of matching the most valuable answers to their corresponding questions has become an increasingly popular research topic. We frame this task as a binary (relevant/irrelevant) classification problem, and present an adversarial training framework to alleviate label imbalance issue.We employ a generative model to iteratively sample a subset of challenging negative samples to fool our classification model. Both models are alternatively optimized using REIN-FORCE algorithm. The proposed method is completely different from previous ones, where negative samples in training set are directly used or uniformly down-sampled. Further, we propose using Multi-scale Matching which explicitly inspects the correlation between words and ngrams of different levels of granularity. We evaluate the proposed method on SemEval 2016 and SemEval 2017 datasets and achieves state-of-the-art or similar performance.

show abstract

How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions

Chu

Chen

Jing

et al. 2020

AAAI

View full text Add to dashboard Cite

We present a large-scale dataset for the task of rewriting an ill-formed natural language question to a well-formed one. Our multi-domain question rewriting (MQR) dataset is constructed from human contributed Stack Exchange question edit histories. The dataset contains 427,719 question pairs which come from 303 domains. We provide human annotations for a subset of the dataset as a quality estimate. When moving from ill-formed to well-formed questions, the question quality improves by an average of 45 points across three aspects. We train sequence-to-sequence neural models on the constructed dataset and obtain an improvement of 13.2% in BLEU-4 over baseline methods built from other data resources. We release the MQR dataset to encourage research on the problem of question rewriting.1

show abstract

Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching

Xiao¹,

Wang²,

Wang³

et al. 2018

Preprint

View full text Add to dashboard Cite

Characterizing and Supporting Question Answering in Human-to-Human Communication

Yang

Awadallah

Khabsa

et al. 2018

View full text Add to dashboard Cite

How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions

Chu

Chen

et al. 2019

Preprint

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Miaosen Wang

Adversarial Training for Community Question Answer Selection Based on Multi-Scale Matching

How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions

Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching

Characterizing and Supporting Question Answering in Human-to-Human Communication

How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions

Contact Info

Product

Resources

About