Filters
Year range
Source (7)
- Publication . Conference object . Other literature type . Article . Contribution for newspaper or weekly magazine . Preprint . 2018Open AccessAuthors:Rachel Bawden; Rico Sennrich; Alexandra Birch; Barry Haddow;Rachel Bawden; Rico Sennrich; Alexandra Birch; Barry Haddow;Publisher: Association for Computational LinguisticsCountries: Switzerland, United Kingdom, FranceProject: SNSF | Dating structural fabric ... (105212), EC | SUMMA (688139), SNSF | Rich Context in Neural Ma... (169888), EC | HimL (644402), EC | TraMOOC (644333)
For machine translation to tackle discourse phenomena, models must have access to extra-sentential linguistic context. There has been recent interest in modelling context in neural machine translation (NMT), but models have been principally evaluated with standard automatic metrics, poorly adapted to evaluating discourse phenomena. In this article, we present hand-crafted, discourse test sets, designed to test the models' ability to exploit previous source and target sentences. We investigate the performance of recently proposed multi-encoder NMT models trained on subtitles for English to French. We also explore a novel way of exploiting context from the previous sentence. Despite gains using BLEU, multi-encoder models give limited improvement in the handling of discourse phenomena: 50% accuracy on our coreference test set and 53.5% for coherence/cohesion (compared to a non-contextual baseline of 50%). A simple strategy of decoding the concatenation of the previous and current sentence leads to good performance, and our novel strategy of multi-encoding and decoding of two sentences leads to the best performance (72.5% for coreference and 57% for coherence/cohesion), highlighting the importance of target-side context. Final version of paper to appear in Proceedings of NAACL 2018
Substantial popularitySubstantial popularity In top 1%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.
1 Research products, page 1 of 1
Loading
- Publication . Conference object . Other literature type . Article . Contribution for newspaper or weekly magazine . Preprint . 2018Open AccessAuthors:Rachel Bawden; Rico Sennrich; Alexandra Birch; Barry Haddow;Rachel Bawden; Rico Sennrich; Alexandra Birch; Barry Haddow;Publisher: Association for Computational LinguisticsCountries: Switzerland, United Kingdom, FranceProject: SNSF | Dating structural fabric ... (105212), EC | SUMMA (688139), SNSF | Rich Context in Neural Ma... (169888), EC | HimL (644402), EC | TraMOOC (644333)
For machine translation to tackle discourse phenomena, models must have access to extra-sentential linguistic context. There has been recent interest in modelling context in neural machine translation (NMT), but models have been principally evaluated with standard automatic metrics, poorly adapted to evaluating discourse phenomena. In this article, we present hand-crafted, discourse test sets, designed to test the models' ability to exploit previous source and target sentences. We investigate the performance of recently proposed multi-encoder NMT models trained on subtitles for English to French. We also explore a novel way of exploiting context from the previous sentence. Despite gains using BLEU, multi-encoder models give limited improvement in the handling of discourse phenomena: 50% accuracy on our coreference test set and 53.5% for coherence/cohesion (compared to a non-contextual baseline of 50%). A simple strategy of decoding the concatenation of the previous and current sentence leads to good performance, and our novel strategy of multi-encoding and decoding of two sentences leads to the best performance (72.5% for coreference and 57% for coherence/cohesion), highlighting the importance of target-side context. Final version of paper to appear in Proceedings of NAACL 2018
Substantial popularitySubstantial popularity In top 1%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.