Rethinking Self-Attention: Towards Interpretability in Neural Parsing

Mrini, Khalil; Dernoncourt, Franck; Tran, Quan Hung; Bui, Trung; Chang, Walter; Nakashole, Ndapa

doi:10.18653/v1/2020.findings-emnlp.65

Cited by 70 publications

(49 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We use the same setting as in Section 4. Mrini et al (2020) is not identical to the dataset in previous work such as Zhang et al (2020) and Wang and Tu (2020). ‡ : For reference, we confirmed with the authors of He and Choi (2020) that they used a different data pre-processing script with previous work.…”

Section: Comparison With Embedding Weighting and Ensemble Approachesmentioning

confidence: 97%

Automated Concatenation of Embeddings for Structured Prediction

Wang¹,

Jiang²,

Bach³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

Section: Comparison With Embedding Weighting and Ensemble Approachesmentioning

confidence: 97%

Automated Concatenation of Embeddings for Structured Prediction

Wang¹,

Jiang²,

Bach³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

“…We extract instances of clarifications from a resource of revision edits called wikiHowToImprove . Specifically, we used a state-of-the-art a constituency parser (Mrini et al, 2020) to preprocess all revisions from wikiHow- ToImprove and applied a set of rule-based filters to identify specific types of edits (see Table 2).…”

Section: Data Collectionmentioning

confidence: 99%

UnImplicit Shared Task Report: Detecting Clarification Requirements in Instructional Text

Roth¹,

Anthonio²

2021

Proceedings of the 1st Workshop on Understanding Implicit and Underspecified Language

View full text Add to dashboard Cite

This paper describes the data, task setup, and results of the shared task at the First Workshop on Understanding Implicit and Underspecified Language (UnImplicit). The task requires computational models to predict whether a sentence contains aspects of meaning that are contextually unspecified and thus require clarification. Two teams participated and the best scoring system achieved an accuracy of 68%.

show abstract

“…several times alredy i found my self driving in the middle of the crossing in red light luckily at the moment no fines. hehehe :) pykester bank (PTB) (Marcus et al, 1994), and the Englishlanguage parser of Mrini et al (2020), which is the state of the art on the parse trees of the PTB.…”

Section: Setupmentioning

confidence: 99%

Recursive Tree-Structured Self-Attention for Answer Sentence Selection

Mrini¹,

Farcas²,

Nakashole³

2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

Syntactic structure is an important component of natural language text. Recent topperforming models in Answer Sentence Selection (AS2) use self-attention and transfer learning, but not syntactic structure. Tree structures have shown strong performance in tasks with sentence pair input like semantic relatedness. We investigate whether tree structures can boost performance in AS2. We introduce the Tree Aggregation Transformer: a novel recursive, tree-structured self-attention model for AS2. The recursive nature of our model is able to represent all levels of syntactic parse trees with only one additional self-attention layer. Without transfer learning, we establish a new state of the art on the popular TrecQA and WikiQA benchmark datasets. Additionally, we evaluate our method on four Community Question Answering datasets, and find that tree-structured representations have limitations with noisy user-generated text. We conduct probing experiments to evaluate how our models leverage tree structures across datasets. Our findings show that the ability of treestructured models to successfully absorb syntactic information is strongly correlated with a higher performance in AS2.

show abstract

Rethinking Self-Attention: Towards Interpretability in Neural Parsing

Cited by 70 publications

References 35 publications

Automated Concatenation of Embeddings for Structured Prediction

Automated Concatenation of Embeddings for Structured Prediction

UnImplicit Shared Task Report: Detecting Clarification Requirements in Instructional Text

Recursive Tree-Structured Self-Attention for Answer Sentence Selection

Contact Info

Product

Resources

About