Transition-based Parsing with Stack-Transformers

Astudillo, Ramón Fernandez; Ballesteros, Miguel; Naseem, Tahira; Blodgett, Austin; Florian, Radu

doi:10.18653/v1/2020.findings-emnlp.89

Cited by 32 publications

(25 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Then, the alignments are "tuned" with a parser oracle to select the candidates that correspond to the oracle parse that is most similar to the gold AMR. Some AMR parsers (Naseem et al, 2019;Fernandez Astudillo et al, 2020) Figure 1: AMR and alignments for the sentence "Most of the students want to visit New York when they graduate." Alignments are differentiated by colors: blue (subgraphs), green (duplicate subgraphs), and orange (relations).…”

Section: Related Workmentioning

confidence: 99%

Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Alignments

Blodgett

Schneider

2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

We present algorithms for aligning components of Abstract Meaning Representation (AMR) graphs to spans in English sentences. We leverage unsupervised learning in combination with heuristics, taking the best of both worlds from previous AMR aligners. Our unsupervised models, however, are more sensitive to graph substructures, without requiring a separate syntactic parse. Our approach covers a wider variety of AMR substructures than previously considered, achieves higher coverage of nodes and edges, and does so with higher accuracy. We will release our LEAMR datasets and aligner for use in research on AMR parsing, generation, and evaluation.

show abstract

Section: Related Workmentioning

confidence: 99%

Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Alignments

Blodgett

Schneider

2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

show abstract

“…Given an input sentence S, we first use a pretrained transformer-based AMR parser (Fernandez Astudillo et al, 2020) to obtain the AMR graph for S. We then use RoBERTa to encode each sentence to identify entity mentions and event triggers as candidate nodes. After that, we map each candidate node to AMR nodes and enforce message passing using a GATbased semantic graph aggregator to capture global inter-dependency between candidate nodes.…”

Section: Our Approachmentioning

confidence: 99%

“…We employ a transformer based AMR parser (Fernandez Astudillo et al, 2020) pre-trained on AMR 3.0 annotations 2 to generate an AMR graph G a = (V a , E a ) with an alignment between AMR nodes and word spans in an input sentence S. Each node v a i = (m a i , n a i ) ∈ V a represents an AMR concept or predicate, and we use m a i and n a i to denote the starting and ending indices of such a node in the original sentence. For AMR edges, we use e a i,j to denote the specific relation type between nodes v a i and v a j in AMR annotations.…”

Section: Amr Parsingmentioning

confidence: 99%

Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information Extraction

Zhang¹,

Ji²

2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

The tasks of Rich Semantic Parsing, such as Abstract Meaning Representation (AMR), share similar goals with Information Extraction (IE) to convert natural language texts into structured semantic representations. To take advantage of such similarity, we propose a novel AMR-guided framework for joint information extraction to discover entities, relations, and events with the help of a pre-trained AMR parser. Our framework consists of two novel components: 1) an AMR based semantic graph aggregator to let the candidate entity and event trigger nodes collect neighborhood information from AMR graph for passing message among related knowledge elements; 2) an AMR guided graph decoder to extract knowledge elements based on the order decided by the hierarchical structures in AMR. Experiments on multiple datasets have shown that the AMR graph encoder and decoder have provided significant gains and our approach has achieved new state-of-the-art performance on all IE subtasks 1 .

show abstract

“…Finally, our approach relates to the other works that propose ways of incorporating structural information into Transformer-based models. This includes the use of dependency or tree structure for constraining self-attention patterns (Strubell et al, 2018;Wang et al, 2019;, guiding cross-attention (Chen et al, 2018;Astudillo et al, 2020), modelling syntactic distance (Du et al, 2020), using syntactic information to guide the computation flow in the model (Shen et al, 2021), or through knowledge distillation (Kuncoro et al, 2020). Our structured masking in parsing as language modeling approach is close in spirit to the methods that modify attention mechanism according to syntactic connections (Astudillo et al, 2020); This work, however, primarily aims to study the impact of structural guidance on syntactic generalization.…”

Section: Related Workmentioning

confidence: 99%

Structural Guidance for Transformer Language Models

Qian

Naseem

Lévy

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

Transformer-based language models pretrained on large amounts of text data have proven remarkably successful in learning generic transferable linguistic representations.Here we study whether structural guidance leads to more human-like systematic linguistic generalization in Transformer language models without resorting to pre-training on very large amounts of data. We explore two general ideas. The "Generative Parsing" idea jointly models the incremental parse and word sequence as part of the same sequence modeling task. The "Structural Scaffold" idea guides the language model's representation via additional structure loss that separately predicts the incremental constituency parse. We train the proposed models along with a vanilla Transformer language model baseline on a 14 million-token and a 46 million-token subset of the BLLIP dataset, and evaluate models' syntactic generalization performances on SG Test Suites and sized BLiMP. Experiment results across two benchmarks suggest converging evidence that generative structural supervisions can induce more robust and humanlike linguistic generalization in Transformer language models without the need for data intensive pre-training.

show abstract

Transition-based Parsing with Stack-Transformers

Cited by 32 publications

References 24 publications

Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Alignments

Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Alignments

Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information Extraction

Structural Guidance for Transformer Language Models

Contact Info

Product

Resources

About