Machine translation evaluation with neural networks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Sep 2017Embargo end date: 01 Jan 2017 English Publisher:Elsevier BVJournal:Computer Speech & Language, volume 45, pages 180-200 (issn: 0885-2308,

Copyright policy )

Authors: Francisco Guzmán; Shafiq R. Joty; Lluís Màrquez; Preslav Nakov;

doi: 10.1016/j.csl.2016.12.005 , 10.48550/arxiv.1710.02095

arXiv: 1710.02095

Machine translation evaluation with neural networks

- Summary
- Subjects
- Metrics

Abstract

We present a framework for machine translation evaluation using neural networks in a pairwise setting, where the goal is to select the better translation from a pair of hypotheses, given the reference translation. In this framework, lexical, syntactic and semantic information from the reference and the two hypotheses is embedded into compact distributed vector representations, and fed into a multi-layer neural network that models nonlinear interactions between each of the hypotheses and the reference, as well as between the two hypotheses. We experiment with the benchmark datasets from the WMT Metrics shared task, on which we obtain the best results published so far, with the basic network configuration. We also perform a series of experiments to analyze and understand the contribution of the different components of the network. We evaluate variants and extensions, including fine-tuning of the semantic embeddings, and sentence-based representations modeled with convolutional and recurrent neural networks. In summary, the proposed framework is flexible and generalizable, allows for efficient learning and scoring, and provides an MT evaluation metric that correlates with human judgments, and is on par with the state of the art.

Machine Translation, Reference-based MT Evaluation, Deep Neural Networks, Distributed Representation of Texts, Textual Similarity

Related Organizations

Hamad bin Khalifa University
Qatar
Qatar Computing Research Institute
Qatar
Qatar Foundation
Qatar

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, I.2.7, 68T50, Computation and Language (cs.CL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	19
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

19

Top 10%

Green

bronze

Fields of Science (4) View all

natural sciences

Fields of Science

natural sciences

View all