The LIG system for the English-Czech Text Translation Task of IWSLT 2019

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Preprint , Other literature type 01 Jan 2019Embargo end date: 01 Jan 2019Publisher:arXivJournal:CoRR, volume abs/1911.02898

Authors: Loïc Vial; Benjamin Lecouteux; Didier Schwab; Hang Le 0001; Laurent Besacier;

doi: 10.48550/arxiv.1911.02898 , 10.5281/zenodo.3525528 , 10.5281/zenodo.3525529

arXiv: 1911.02898

The LIG system for the English-Czech Text Translation Task of IWSLT 2019

- Summary
- Subjects
- Metrics

Abstract

In this paper, we present our submission for the English to Czech Text Translation Task of IWSLT 2019. Our system aims to study how pre-trained language models, used as input embeddings, can improve a specialized machine translation system trained on few data. Therefore, we implemented a Transformer-based encoder-decoder neural system which is able to use the output of a pre-trained language model as input embeddings, and we compared its performance under three configurations: 1) without any pre-trained language model (constrained), 2) using a language model trained on the monolingual parts of the allowed English-Czech data (constrained), and 3) using a language model trained on a large quantity of external monolingual data (unconstrained). We used BERT as external pre-trained language model (configuration 3), and BERT architecture for training our own language model (configuration 2). Regarding the training data, we trained our MT system on a small quantity of parallel text: one set only consists of the provided MuST-C corpus, and the other set consists of the MuST-C corpus and the News Commentary corpus from WMT. We observed that using the external pre-trained BERT improves the scores of our system by +0.8 to +1.5 of BLEU on our development set, and +0.97 to +1.94 of BLEU on the test set. However, using our own language model trained only on the allowed parallel data seems to improve the machine translation performances only when the system is trained on the smallest dataset.

IWSLT 2019

Related Organizations

University of Grenoble
France
French National Centre for Scientific Research
France
Grenoble Alpes University
France

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average