Found an issue? Give us feedback

https://dx.doi.org/1...arrow_drop_down

https://dx.doi.org/10.18720/sp...

Other literature type . 2020

Data sources: Datacite

descriptionPublicationkeyboard_double_arrow_right Other literature type 01 Jan 2020 Russian Publisher:Ð¡Ð°Ð½ÐºÑ-ÐÐµÑÐµÑÐ±ÑÑÐ³ÑÐºÐ¸Ð¹ Ð¿Ð¾Ð»Ð¸ÑÐµÑÐ½Ð¸ÑÐµÑÐºÐ¸Ð¹ ÑÐ½Ð¸Ð²ÐµÑÑÐ¸ÑÐµÑ ÐÐµÑÑÐ° ÐÐµÐ»Ð¸ÐºÐ¾Ð³Ð¾

doi: 10.18720/spbpu/3/2020/vr/vr20-1432

- Summary
- Subjects
- Metrics

Abstract

In the context of an increasingly networked world, the availability of highquality translations is critical for success in the context of the growing international competition. Massive worldwide companies as well as medium sized companies are required to provide well translated, high quality technical documentation for their customers not only to be successful in the market but also to meet legal regulations and to avoid lawsuits. Therefore, this thesis focuses on the evaluation of translation quality, specifically regarding technical documents, and answers two central questions: How can the translation quality of technical documents be calculated, given the original document is available? How can the translation quality of technical documents be assessed, given the original document is not available? These questions are answered using state-of-the-art machine learning algorithms and translation evaluation metrics in the context of a knowledge discovery process. The evaluations are done on a sentence level and recombined on a document level by binarily categorizing sentences as computerized translation and specialized translation. The research is based on a database including 22,327 sentences and 32 translation evaluation attributes, which are used for optimizations of five different machine learning approaches. An optimization method consisting of 795,000 evaluations shows a calculation accuracy of up to 72.24% for the binary classification. Based on the established sentence-based classification systems, documents are classified using recombination of the affiliated sentences and a background for rating document quality is established. Therefore, the taken approach absolutely creates a Ñategorization and assessment approach.

Keywords

word error rates, Ð¼Ð°ÑÐ¸Ð½Ð½ÑÐ¹ Ð¿ÐµÑÐµÐ²Ð¾Ð´, Ð¸Ð·Ð²Ð»ÐµÑÐµÐ½Ð¸Ðµ Ð´Ð°Ð½Ð½ÑÑ, ÐºÐ¾ÑÑÑÐ¸ÑÐ¸ÐµÐ½Ñ Ð¾ÑÐ¸Ð±Ð¾Ðº Ð² ÑÐ»Ð¾Ð²Ð°Ñ, RapidMiner, data mining, Ð¸ÑÐºÑÑÑÑÐ²ÐµÐ½Ð½Ð°Ñ Ð½ÐµÐ¹ÑÐ¾Ð½Ð½Ð°Ñ ÑÐµÑÑ, artificial neural network, machine translation

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now

ÐžÑ†ÐµÐ½ÐºÐ° ÐºÐ°Ñ‡ÐµÑÑ‚Ð²Ð° Ñ‚ÐµÑ Ð½Ð¸Ñ‡ÐµÑÐºÐ¾Ð¹ Ð´Ð¾ÐºÑƒÐ¼ÐµÐ½Ñ‚Ð°Ñ†Ð¸Ð¸ Ñ Ð¿Ð¾Ð¼Ð¾Ñ‰ÑŒÑŽ Ð¼Ð°ÑˆÐ¸Ð½Ð½Ð¾Ð³Ð¾ Ð¾Ð±ÑƒÑ‡ÐµÐ½Ð¸Ñ

ÐžÑ†ÐµÐ½ÐºÐ° ÐºÐ°Ñ‡ÐµÑÑ‚Ð²Ð° Ñ‚ÐµÑ Ð½Ð¸Ñ‡ÐµÑÐºÐ¾Ð¹ Ð´Ð¾ÐºÑƒÐ¼ÐµÐ½Ñ‚Ð°Ñ†Ð¸Ð¸ Ñ Ð¿Ð¾Ð¼Ð¾Ñ‰ÑŒÑŽ Ð¼Ð°ÑˆÐ¸Ð½Ð½Ð¾Ð³Ð¾ Ð¾Ð±ÑƒÑ‡ÐµÐ½Ð¸Ñ

ÐžÑ†ÐµÐ½ÐºÐ° ÐºÐ°Ñ‡ÐµÑÑ‚Ð²Ð° Ñ‚ÐµÑ Ð½Ð¸Ñ‡ÐµÑÐºÐ¾Ð¹ Ð´Ð¾ÐºÑƒÐ¼ÐµÐ½Ñ‚Ð°Ñ†Ð¸Ð¸ Ñ Ð¿Ð¾Ð¼Ð¾Ñ‰ÑŒÑŽ Ð¼Ð°ÑˆÐ¸Ð½Ð½Ð¾Ð³Ð¾ Ð¾Ð±ÑƒÑ‡ÐµÐ½Ð¸Ñ

ÐžÑ†ÐµÐ½ÐºÐ° ÐºÐ°Ñ‡ÐµÑÑ‚Ð²Ð° Ñ‚ÐµÑ Ð½Ð¸Ñ‡ÐµÑÐºÐ¾Ð¹ Ð´Ð¾ÐºÑƒÐ¼ÐµÐ½Ñ‚Ð°Ñ†Ð¸Ð¸ Ñ Ð¿Ð¾Ð¼Ð¾Ñ‰ÑŒÑŽ Ð¼Ð°ÑˆÐ¸Ð½Ð½Ð¾Ð³Ð¾ Ð¾Ð±ÑƒÑ‡ÐµÐ½Ð¸Ñ