Dynamic Data Selection for Neural Machine Translation

M. van der Wees; A. Bisazza; C. Monz

doi:https://doi.org/10.18653/v1/D17-1147

Dynamic Data Selection for Neural Machine Translation

Authors	M. van der Wees A. Bisazza C. Monz
Publication date	2017
Host editors	M. Palmer R. Hwa S. Riedel
Book title	The Conference on Empirical Methods in Natural Language Processing
Book subtitle	proceedings of the conference : EMNLP 2017 : September 9-11, 2017, Copenhagen, Denmark
ISBN (electronic)	9781945626838
Event	2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017
Pages (from-to)	1400-1410
Publisher	Stroudsburg, PA: Association for Computational Linguistics
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Intelligent selection of training data has proven a successful technique to simultaneously increase training efficiency and translation performance for phrase-based machine translation (PBMT). With the recent increase in popularity of neural machine translation (NMT), we explore in this paper to what extent and how NMT can also benefit from data selection. While state-of-the-art data selection (Axelrod et al., 2011) consistently performs well for PBMT, we show that gains are substantially lower for NMT. Next, we introduce ‘dynamic data selection’ for NMT, a method in which we vary the selected subset of training data between different training epochs. Our experiments show that the best results are achieved when applying a technique we call ‘gradual fine-tuning’, with improvements up to +2.6 BLEU over the original data selection approach and up to +3.1 BLEU over a general baseline.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.18653/v1/D17-1147 (Final published version)
Downloads	D17-1147 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Dynamic Data Selection for Neural Machine Translation