Compositional Generalization without Trees using Multiset Tagging and Latent Permutations
| Authors |
|
|---|---|
| Publication date | 2023 |
| Host editors |
|
| Book title | The 61st Conference of the Association for Computational Linguistics |
| Book subtitle | ACL 2023 : Proceedings of the Conference : July 9-14, 2023 |
| ISBN |
|
| ISBN (electronic) |
|
| Event | 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 |
| Volume | Issue number | 1 |
| Pages (from-to) | 14488-14506 |
| Number of pages | 19 |
| Publisher | Association for Computational Linguistics |
| Organisations |
|
| Abstract |
Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation. We phrase semantic parsing as a two-step process: we first tag each input token with a multiset of output tokens. Then we arrange the tokens into an output sequence using a new way of parameterizing and predicting permutations. We formulate predicting a permutation as solving a regularized linear program and we backpropagate through the solver. In contrast to prior work, our approach does not place a priori restrictions on possible permutations, making it very expressive. Our model outperforms pretrained seq2seq models and prior work on realistic semantic parsing tasks that require generalization to longer examples. We also outperform non-tree-based models on structural generalization on the COGS benchmark. For the first time, we show that a model without an inductive bias provided by trees achieves high accuracy on generalization to deeper recursion depth. |
| Document type | Conference contribution |
| Note | With supplementary video. |
| Language | English |
| Published at | https://doi.org/10.18653/v1/2023.acl-long.810 |
| Other links | https://www.scopus.com/pages/publications/85174403067 |
| Downloads |
2023.acl-long.810
(Final published version)
|
| Supplementary materials | |
| Permalink to this page | |