What does Attention in Neural Machine Translation Pay Attention to?

Authors
Publication date 2017
Host editors
  • G. Kondrak
  • T. Watanabe
Book title The Eight International Joint Conference on Natural Language Processing
Book subtitle proceedings of the Conference : November 27-December 1, 2017, Taipei, Taiwan
ISBN
  • 9781948087001
Event The 8th International Joint Conference on Natural Language Processing
Volume | Issue number 1
Pages (from-to) 30-39
Publisher Asian Federation of Natural Language Processing
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Attention in neural machine translation provides the possibility to encode relevant parts of the source sentence at each translation step. As a result, attention is considered to be an alignment model as well. However, there is no work that specifically studies attention and provides analysis of what is being learned by attention models. Thus, the question still remains that how attention is similar or different from the traditional alignment. In this paper, we provide detailed analysis of attention and compare it to traditional alignment. We answer the question of whether attention is only capable of modelling translational equivalent or it captures more information. We show that attention is different from alignment in some cases and is capturing useful information other than alignments.
Document type Conference contribution
Language English
Published at http://www.aclweb.org/anthology/I17-1004
Permalink to this page
Back