Wenyue Hua scite author profile

Wenyue Hua

5Publications

6Citation Statements Received

53Citation Statements Given

How they've been cited

How they cite others

Affiliations

Rutgers Sexual and Reproductive Health and Rights

Publications

Order By: Most citations

A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression

Sun¹,

Hua²,

Liu³

et al. 2020

View full text Add to dashboard Cite

Existing OIE (Open Information Extraction) algorithms are independent of each other such that there exist lots of redundant works; the featured strategies are not reusable and not adaptive to new tasks. This paper proposes a new pipeline to build OIE systems, where an Open-domain Information eXpression (OIX) task is proposed to provide a platform for all OIE strategies. The OIX is an OIE friendly expression of a sentence without information loss. The generation procedure of OIX contains shared works of OIE algorithms so that OIE strategies can be developed on the platform of OIX as inference operations focusing on more critical problems. Based on the same platform of OIX, the OIE strategies are reusable, and people can select a set of strategies to assemble their algorithm for a specific task so that the adaptability may be significantly increased. This paper focuses on the task of OIX and propose a solution -Open Information Annotation (OIA). OIA is a predicate-function-argument annotation for sentences. We label a data set of sentence-OIA pairs and propose a dependency-based rule system to generate OIA annotations from sentences. The evaluation results reveal that learning the OIA from a sentence is a challenge owing to the complexity of natural language sentences, and it is worthy of attracting more attention from the research community.

show abstract

LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension

Hua¹,

Zhang²,

Chen³

et al. 2022

Preprint

View full text Add to dashboard Cite

EntQA: Entity Linking as Question Answering

Zhang

Hua

Stratos

2021

Preprint

View full text Add to dashboard Cite

A conventional approach to entity linking is to first find mentions in a given document and then infer their underlying entities in the knowledge base. A well-known limitation of this approach is that it requires finding mentions without knowing their entities, which is unnatural and difficult. We present a new model that does not suffer from this limitation called EntQA, which stands for Entity linking as Question Answering. EntQA first proposes candidate entities with a fast retrieval module, and then scrutinizes the document to find mentions of each candidate with a powerful reader module. Our approach combines progress in entity linking with that in open-domain question answering and capitalizes on pretrained models for dense entity retrieval and reading comprehension. Unlike in previous works, we do not rely on a mention-candidates dictionary or large-scale weak supervision. EntQA achieves strong results on the GERBIL benchmarking platform.

show abstract

Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing

Hua¹,

Jin²,

Song³

et al. 2022

Preprint

View full text Add to dashboard Cite

Current natural language processing (NLP) models such as BERT and RoBERTa have achieved high overall performance, but they often make systematic errors due to bias or certain difficult features to learn. Thus research on slice detection models (SDM) which automatically identifies underperforming groups of datapoints has gradually caught more attention, which aims at both understanding model behaviors and providing insights for future model training and designing. However, there is little systematic research on SDM and quantitative evaluation of its assessment for NLP models. Our paper fills this gap by proposing "Discover, Explanation, Improvement (DEI)" framework that discovers coherent and underperforming groups of datapoints and unites datapoints of each slice under human-understandable concepts; it also provides comprehensive evaluation tasks and the corresponding quantitative metrics, which enable convenient comparison for future works. Results show that our framework can accurately select error-prone datapoints with informative semantic features that summarize error patterns, based on which it directly improves model performance by an average of 2.85 points without tuning any parameters.

show abstract

System 1 + System 2 = Better World: Neural-Symbolic Chain of Logic Reasoning

Hua¹,

Zhang²

2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wenyue Hua

A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression

LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension

EntQA: Entity Linking as Question Answering

Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing

System 1 + System 2 = Better World: Neural-Symbolic Chain of Logic Reasoning

Contact Info

Product

Resources

About