Capturing Argument Interaction in Semantic Role Labeling with Capsule Networks
| Authors | |
|---|---|
| Publication date | 2019 |
| Host editors |
|
| Book title | 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing |
| Book subtitle | EMNLP-IJCNLP 2019 : proceedings of the conference : November 3-7, 2019, Hong Kong, China |
| ISBN (electronic) |
|
| Event | 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing |
| Pages (from-to) | 5415–5425 |
| Publisher | Stroudsburg, PA: The Association for Computational Linguistics |
| Organisations |
|
| Abstract |
Semantic role labeling (SRL) involves extracting propositions (i.e.
predicates and their typed arguments) from natural language sentences.
State-of-the-art SRL models rely on powerful encoders (e.g., LSTMs) and
do not model non-local interaction between arguments. We propose a new
approach to modeling these interactions while maintaining efficient
inference. Specifically, we use Capsule Networks (Sabour et al., 2017):
each proposition is encoded as a tuple of capsules, one capsule
per argument type (i.e. role). These tuples serve as embeddings of
entire propositions. In every network layer, the capsules interact with
each other and with representations of words in the sentence. Each
iteration results in updated proposition embeddings and updated
predictions about the SRL structure. Our model substantially outperforms
the non-refinement baseline model on all 7 CoNLL-2019 languages and
achieves state-of-the-art results on 5 languages (including English) for
dependency SRL. We analyze the types of mistakes corrected by the
refinement procedure. For example, each role is typically (but not
always) filled with at most one argument. Whereas enforcing this
approximate constraint is not useful with the modern SRL system,
iterative procedure corrects the mistakes by capturing this intuition in
a flexible and context-sensitive way.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.18653/v1/D19-1544 |
| Downloads |
D19-1544
(Final published version)
|
| Permalink to this page | |