Learning Algebraic Recombination for Compositional Generalization

Liu, Chenyao; An, Shengnan; Lin, Zeqi; Liu, Qian; Chen, Bei; Lou, Jian–Guang; Wen, Lijie; Zheng, Nanning; Zhang, Dongmei

doi:10.18653/v1/2021.findings-acl.97

“…For SCAN, NQG-T5 is one of several specialized models that achieves 100% accuracy across multiple splits (Chen et al, 2020;Nye et al, 2020;Herzig and Berant, 2021). For COGS, we show results from LeAR (Liu et al, 2021), the previously reported state-of-the-art on COGS. 18 We also report new results for NQG-T5 on COGS.…”

Section: Baselinesmentioning

confidence: 72%

“…When we use CSL to generate additional training data for T5 (T5+CSL-Aug.), the performance of T5 improves to nearly solving T5-3B. 18 We do not show LeAR results for SCAN and GeoQuery as Liu et al (2021) did not report results for SCAN and reported GeoQuery results using a different template split and a different evaluation metric.…”

Section: Resultsmentioning

confidence: 98%

Improving Compositional Generalization with Latent Structure and Data Augmentation

Qiu¹,

Shaw²,

Pasupat³

et al. 2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

Generic unstructured neural networks have been shown to struggle on out-of-distribution compositional generalization. Compositional data augmentation via example recombination has transferred some prior knowledge about compositionality to such black-box neural models for several semantic parsing tasks, but this often required task-specific engineering or provided limited gains.We present a more powerful data recombination method using a model called Compositional Structure Learner (CSL). CSL is a generative model with a quasi-synchronous context-free grammar backbone, which we induce from the training data. We sample recombined examples from CSL and add them to the fine-tuning data of a pre-trained sequence-tosequence model (T5). This procedure effectively transfers most of CSL's compositional bias to T5 for diagnostic tasks, and results in a model even stronger than a T5-CSL ensemble on two real world compositional generalization tasks. This results in new state-ofthe-art performance for these challenging semantic parsing tasks requiring generalization to both natural language variation and novel compositions of elements. * Equal contribution. † Work done as part of the Google AI Residency program. 1 Also commonly referred to as elements or concepts.

show abstract

“…For SCAN, NQG-T5 Shaw et al ( 2021) is one of several specialized models that achieves 100% accuracy across multiple splits Nye et al, 2020;. We also report new results for NQG-T5 on COGS, and show results from LeAR (Liu et al, 2021), the previously reported state-of-the-art on COGS. For these synthetic datasets, the induced grammars have high coverage, making the CSL model highly effective for data augmentation.…”

Section: Resultsmentioning

confidence: 51%

Improving Compositional Generalization with Latent Structure and Data Augmentation

Qiu¹,

Shaw²,

Pasupat³

et al. 2021

Preprint

View full text Add to dashboard Cite

Generic unstructured neural networks have been shown to struggle on out-of-distribution compositional generalization. Compositional data augmentation via example recombination has transferred some prior knowledge about compositionality to such black-box neural models for several semantic parsing tasks, but this often required task-specific engineering or provided limited gains.We present a more powerful data recombination method using a model called Compositional Structure Learner (CSL). CSL is a generative model with a quasi-synchronous context-free grammar backbone, which we induce from the training data. We sample recombined examples from CSL and add them to the fine-tuning data of a pre-trained sequence-tosequence model (T5). This procedure effectively transfers most of CSL's compositional bias to T5 for diagnostic tasks, and results in a model even stronger than a T5-CSL ensemble on two real world compositional generalization tasks. This results in new state-ofthe-art performance for these challenging semantic parsing tasks requiring generalization to both natural language variation and novel compositions of elements. * Equal contribution. † Work done as part of the Google AI Residency program. 1 Also commonly referred to as elements or concepts.

show abstract

“…On structural generalization in particular, the accuracy of all these models is below 10%, with the exception of Zheng and Lapata (2021), who achieve 39% on PP recursion. By contrast, the compositional model of Liu et al (2021) and the model of Qiu et al (2022), which uses compositional data augmentation, achieve accuracies upwards of 98% on the full generalization set.…”

Section: Compositional Generalization In Cogsmentioning

confidence: 94%

“…For instance, Shaw et al (2021) describe a synchronous grammar induction approach that achieves perfect accuracy on SCAN (Lake and Baroni, 2018), but has very low accuracy on corpora of naturally occurring text such as GeoQuery (Zelle and Mooney, 1996) and Spider (Yu et al, 2018). Similarly, the compositional LeAR parser (Liu et al, 2021) solves COGS with near-perfect accuracy and performs very well on other synthetic datasets, but has not been evaluated on corpora of naturally occurring text. This points to a fundamental tension between broad-coverage semantic parsing on natural text and the ability to generalize compositionally from structurally limited synthetic training sets (see also Shaw et al, 2021).…”

Section: Introductionmentioning

confidence: 99%

Compositional generalization with a broad-coverage semantic parser

Weißenhorn¹,

Donatelli²,

Koller³

2022

Proceedings of the 11th Joint Conference on Lexical and Computational Semantics

View full text Add to dashboard Cite

We show how the AM parser, a compositional semantic parser (Groschwitz et al., 2018), can solve compositional generalization on the COGS dataset. It is the first semantic parser that achieves high accuracy on both naturally occurring language and the synthetic COGS dataset. We discuss implications for corpus and model design for learning human-like generalization. Our results suggest that compositional generalization can be best achieved by building compositionality into semantic parsers.

show abstract

Learning Algebraic Recombination for Compositional Generalization

Cited by 19 publications

References 27 publications

Improving Compositional Generalization with Latent Structure and Data Augmentation

Improving Compositional Generalization with Latent Structure and Data Augmentation

Improving Compositional Generalization with Latent Structure and Data Augmentation

Compositional generalization with a broad-coverage semantic parser

Contact Info

Product

Resources

About