Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling

Shou, Linjun; Gong, Mali; Pei, Jian; Jiang, Daxin

doi:10.18653/v1/2022.naacl-main.139

Cited by 3 publications

(3 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, we collect Orca in few-shot settings, challenging models to learn unseen domains with few samples. Although one can question previous full-data CMRC datasets also could support few-shot training for CMRC models, we argue that it could lead to unclear comparisons due to the inconsistent settings of different works on these datasets (Chen et al, 2022b). In contrast, we present a single standard benchmark for thorough comparisons.…”

Section: B Related Workmentioning

confidence: 94%

Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension

Chen,

Li,

et al. 2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

The Conversational Machine Reading Comprehension (CMRC) task aims to answer questions in conversations, which has been a hot research topic because of its wide applications. However, existing CMRC benchmarks in which each conversation is coupled with a static passage are inconsistent with real scenarios. In this regard, it is hard to evaluate model's comprehension ability towards real scenarios. In this work, we propose the first Chinese CMRC benchmark Orca and further provide zeroshot/few-shot settings to evaluate model's generalization ability towards diverse domains. We collect 831 hot-topic driven conversations with 4,742 turns in total. Each turn of a conversation is assigned with a response-related passage, aiming to evaluate model's comprehension ability more reasonably. The topics of conversations are collected from social media platform and cover 33 domains, trying to be consistent with real scenarios. Importantly, answers in Orca are all well-annotated natural responses rather than specific spans or short phrases in previous datasets. We implement two strong frameworks to tackle the challenge in Orca. The results indicate there is substantial room for improvement for strong baselines such as ChatGPT on our CMRC benchmark. Our codes and datasets are available at: https: //github.com/nuochenpku/Orca.

show abstract

Section: B Related Workmentioning

confidence: 94%

Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension

Chen,

Li,

et al. 2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

show abstract

“…Based on these challenging datasets, a great number of end-to-end approaches have been proposed, including BiDAF (Seo et al, 2016), DCN (Xiong et al, 2016), R-Net (Wang et al, 2017). In MRC tasks, attention mechanism (Dong et al, 2020a;Gao et al, 2020;Zhu et al, 2020;Chen et al, 2022) have become an essential part to capture dependencies without considering their distance in the input/output sequences. Recently, some works show that well pre-trained models are powerful and convenient for downstream tasks, such as R-Trans (Liu et al, 2019a), DCMN+ , ALBERT (Lan et al, 2020) and GF-Net (Lee and Kim, 2020), which facilitate us to take pre-trained models as our backbone encoder.…”

Section: Related Workmentioning

confidence: 99%

Exploring and Exploiting Multi-Granularity Representations for Machine Reading Comprehension

You¹

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Recently, the attention-enhanced multi-layer encoder, such as Transformer, has been extensively studied in Machine Reading Comprehension (MRC). To predict the answer, it is common practice to employ a predictor to draw information only from the final encoder layer which generates the coarse-grained representations of the source sequences, i.e., passage and question. The analysis shows that the representation of source sequence becomes more coarse-grained from finegrained as the encoding layer increases. It is generally believed that with the growing number of layers in deep neural networks, the encoding process will gather relevant information for each location increasingly, resulting in more coarse-grained representations, which adds the likelihood of similarity to other locations (referring to homogeneity). Such phenomenon will mislead the model to make wrong judgement and degrade the performance. In this paper, we argue that it would be better if the predictor could exploit representations of different granularity from the encoder, providing different views of the source sequences, such that the expressive power of the model could be fully utilized. To this end, we propose a novel approach called Adaptive Bidirectional Attention-Capsule Network (ABA-Net), which adaptively exploits the source representations of different levels to the predictor. Furthermore, due to the better representations are at the core for boosting MRC performance, the capsule network and self-attention module are carefully designed as the building blocks of our encoders, which provides the capability to explore the local and global representations, respectively. Experimental results on three benchmark datasets, i.e., SQuAD 1.0, SQuAD 2.0 and COQA, demonstrate the effectiveness of our approach. In particular, we set the new state-of-the-art performance on the SQuAD 1.0 dataset.

show abstract

“…Much of this popularity can be attributed to the release of many annotated and publicly available datasets (Rajpurkar et al, 2016;Trischler et al, 2016;Chen et al, 2022a;You et al, 2022;Chen et al, 2023a). Formally, these MRC efforts can be classified into two most popular streams 1 from the answer type perspective: span-extraction (Rajpurkar et al, 2016;Trischler et al, 2016;Cui et al, 2019;Chen et al, 2022b;You et al, 2021a) and multiple choices (Lai et al, 2017;Zellers et al, 2018;Wang et al, 2020). The former requires the model to locate the text span in the given passage as the answer, e.g., SQuAD (Rajpurkar et al, 2016) and NewQA (Trischler et al, 2016).…”

Section: Introductionmentioning

confidence: 99%

Natural Response Generation for Chinese Reading Comprehension

Chen,

Li,

Bao

et al. 2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

Machine reading comprehension (MRC) is an important area of conversation agents and draws a lot of attention. However, there is a notable limitation to current MRC benchmarks: The labeled answers are mostly either spans extracted from the target corpus or the choices of the given candidates, ignoring the natural aspect of high-quality responses. As a result, MRC models trained on these datasets can not generate human-like responses in real QA scenarios. To this end, we construct a new dataset called Penguin to promote the research of MRC, providing a training and test bed for natural response generation to real scenarios. Concretely, Penguin consists of 200k training data with high-quality fluent, and well-informed responses. Penguin is the first benchmark towards natural response generation in Chinese MRC on a relatively large scale. To address the challenges in Penguin, we develop two strong baselines: endto-end and two-stage frameworks. Following that, we further design Prompt-BART: finetuning the pre-trained generative language models with a mixture of prefix prompts in Penguin. Extensive experiments validated the effectiveness of this design. Our benchmark and codes are available at https://github. com/nuochenpku/Penguin.

show abstract

Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling

Cited by 3 publications

References 31 publications

Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension

Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension

Exploring and Exploiting Multi-Granularity Representations for Machine Reading Comprehension

Natural Response Generation for Chinese Reading Comprehension

Contact Info

Product

Resources

About