Dict-BERT: Enhancing Language Model Pre-training with Dictionary

Yu, Wenhao; Zhu, Chenguang; Fang, Yuwei; Yu, Dawei; Wang, Shuohang; Xu, Yichong; Zeng, Michael; Jiang, Meng

doi:10.18653/v1/2022.findings-acl.150

Cited by 33 publications

(36 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another research line aims to boost the performance of large-scale LLMs to higher levels by leveraging the self-generated content. Some works use the self-generation as contexts to assist themselves in answering questions, such as eliciting intermediate rationales as CoT (Kojima et al, 2023) or generating background articles for reading comprehension (Yu et al, 2023). While others instruct LLMs to generate demonstrations for ICL during inference , such as prompting LLMs to generate reliable QA pairs as self-prompted in-context demonstrations .…”

Section: Model Enhancement Via Llm Generationmentioning

confidence: 99%

Medicine-Engineering Interdisciplinary Research Based on Bibliometric Analysis: A Case Study on Medicine-Engineering Institutional Cooperation of Shanghai Jiao Tong University

Wang

Cui

Deng

2022

J. Shanghai Jiaotong Univ. (Sci.)

View full text Add to dashboard Cite

This article aims to provide reference for medicine-engineering interdisciplinary research. Targeted at the scientific literature and patent literature published by Shanghai Jiao Tong University, this article attempts to set up co-occurrence matrix of medicine-engineering institutional information which was extracted from address fields of the papers, so as to construct the medicine-engineering intersection datasets. The dataset of scientific literature was analyzed using bibliometrics and visualization methods from multiple dimensions, and the most active factors, such as trends of output, journal and subject distribution, were identified from the indicators of category normalized citation impact (CNCI), times cited, keywords, citation topics and the degree of medicine-engineering interdisplinary. Research on hotspots and trends was discussed in detail. Analyses of the dataset of patent literature showed research themes and measured the degree for technology convergence of medicine-engineering.

show abstract

Section: Model Enhancement Via Llm Generationmentioning

confidence: 99%

Medicine-Engineering Interdisciplinary Research Based on Bibliometric Analysis: A Case Study on Medicine-Engineering Institutional Cooperation of Shanghai Jiao Tong University

Wang

Cui

Deng

2022

J. Shanghai Jiaotong Univ. (Sci.)

View full text Add to dashboard Cite

show abstract

“…Incorporating external knowledge is essential for many NLG tasks to augment the limited textual information (Yu et al, 2022c;Dong et al, 2021;Yu et al, 2022b). Some recent work explored using graph neural networks (GNN) to reason over multihop relational knowledge graph (KG) paths (Zhou et al, 2018;Jiang et al, 2019;Zhang et al, 2020a;Wu et al, 2020;Yu et al, 2022a;Zeng et al, 2021).…”

Section: Knowledge Graph For Text Generationmentioning

confidence: 99%

Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

Yu¹,

C²,

Qin³

et al. 2022

Findings of the Association for Computational Linguistics: ACL 2022

Self Cite

View full text Add to dashboard Cite

Generative commonsense reasoning (GCR) in natural language is to reason about the commonsense while generating coherent text. Recent years have seen a surge of interest in improving the generation quality of commonsense reasoning tasks. Nevertheless, these approaches have seldom investigated diversity in the GCR tasks, which aims to generate alternative explanations for a real-world situation or predict all possible outcomes. Diversifying GCR is challenging as it expects to generate multiple outputs that are not only semantically different but also grounded in commonsense knowledge. In this paper, we propose MoKGE, a novel method that diversifies the generative reasoning by a mixture of expert (MoE) strategy on commonsense knowledge graphs (KG). A set of knowledge experts seek diverse reasoning on KG to encourage various generation outputs. Empirical experiments demonstrated that MoKGE can significantly improve the diversity while achieving on par performance on accuracy on two GCR benchmarks, based on both automatic and human evaluations.

show abstract

“…Finally, we devise a denoising auto-encoder-style learning objective and train the network to reconstruct selective masked sentence parts. Our use of symbolic knowledge (Yu et al, 2021) of IEs to aid the learning of their embeddings results in the model needing a significantly small amount of data (∼60MB) compared to that required for LM pre-training (∼160GB of text for BART).…”

Section: All At Seamentioning

confidence: 99%

Getting BART to Ride the Idiomatic Train: Learning to Represent Idiomatic Expressions

Zeng

Bhat

2022

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

Idiomatic expressions (IEs), characterized by their non-compositionality, are an important part of natural language. They have been a classical challenge to NLP, including pre-trained language models that drive today’s state-of-the-art. Prior work has identified deficiencies in their contextualized representation stemming from the underlying compositional paradigm of representation. In this work, we take a first-principles approach to build idiomaticity into BART using an adapter as a lightweight non-compositional language expert trained on idiomatic sentences. The improved capability over baselines (e.g., BART) is seen via intrinsic and extrinsic methods, where idiom embeddings score 0.19 points higher in homogeneity score for embedding clustering, and up to 25% higher sequence accuracy on the idiom processing tasks of IE sense disambiguation and span detection.

show abstract

Dict-BERT: Enhancing Language Model Pre-training with Dictionary

Cited by 33 publications

References 22 publications

Medicine-Engineering Interdisciplinary Research Based on Bibliometric Analysis: A Case Study on Medicine-Engineering Institutional Cooperation of Shanghai Jiao Tong University

Medicine-Engineering Interdisciplinary Research Based on Bibliometric Analysis: A Case Study on Medicine-Engineering Institutional Cooperation of Shanghai Jiao Tong University

Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

Getting BART to Ride the Idiomatic Train: Learning to Represent Idiomatic Expressions

Contact Info

Product

Resources

About