descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Oct 2020Embargo end date: 01 Jan 2020 English Publisher:Elsevier BVJournal:Knowledge-Based Systems, volume 206, page 106,401 (issn: 0950-7051,

Authors: Artetxe Zurutuza, Mikel; Labaka Intxauspe, Gorka; Casas, Noe; Agirre Bengoa, Eneko;

doi: 10.1016/j.knosys.2020.106401 , 10.48550/arxiv.2002.12867

arXiv: http://arxiv.org/abs/2002.12867

handle: 10810/70234

Do all roads lead to Rome? Understanding the role of initialization in iterative back-translation

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Back-translation provides a simple yet effective approach to exploit monolingual corpora in Neural Machine Translation (NMT). Its iterative variant, where two opposite NMT models are jointly trained by alternately using a synthetic parallel corpus generated by the reverse model, plays a central role in unsupervised machine translation. In order to start producing sound translations and provide a meaningful training signal to each other, existing approaches rely on either a separate machine translation system to warm up the iterative procedure, or some form of pre-training to initialize the weights of the model. In this paper, we analyze the role that such initialization plays in iterative back-translation. Is the behavior of the final system heavily dependent on it? Or does iterative back-translation converge to a similar solution given any reasonable initialization? Through a series of empirical experiments over a diverse set of warmup systems, we show that, although the quality of the initial system does affect final performance, its effect is relatively small, as iterative back-translation has a strong tendency to convergence to a similar solution. As such, the margin of improvement left for the initialization method is narrow, suggesting that future research should focus more on improving the iterative mechanism itself.

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Computation and Language (cs.CL), Machine Learning (cs.LG)

1 Research products, page 1 of 1

fairseq software on GitHub
IsRelatedTo

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Top 10%

Average

Green

bronze

Fields of Science (4) View all

natural sciences

Fields of Science

natural sciences

View all

Do all roads lead to Rome? Understanding the role of initialization in iterative back-translation

Do all roads lead to Rome? Understanding the role of initialization in iterative back-translation

1 Research products, page 1 of 1

fairseq software on GitHub