Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Using Machine Learning Methods for Evaluating the Quality of Technical Documents
Linnaeus University, Faculty of Technology, Department of Computer Science.
Linnaeus University, Faculty of Technology, Department of Computer Science.
2016 (English)Independent thesis Advanced level (degree of Master (One Year)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

In the context of an increasingly networked world, the availability of high quality translations is critical for success in the context of the growing international competition. Large international companies as well as medium sized companies are required to provide well translated, high quality technical documentation for their customers not only to be successful in the market but also to meet legal regulations and to avoid lawsuits. Therefore, this thesis focuses on the evaluation of translation quality, specifically concerning technical documentation, and answers two central questions:

  • How can the translation quality of technical documents be evaluated, given the original document is available?
  • How can the translation quality of technical documents be evaluated, given the original document is not available?

These questions are answered using state-of-the-art machine learning algorithms and translation evaluation metrics in the context of a knowledge discovery process. The evaluations are done on a sentence level and recombined on a document level by binarily classifying sentences as automated translation and professional translation. The research is based on a database containing 22, 327 sentences and 32 translation evaluation attributes, which are used for optimizations of five different machine learning approaches. An optimization process consisting of 795, 000 evaluations shows a prediction accuracy of up to 72.24% for the binary classification. Based on the developed sentence-based classifi- cation systems, documents are classified using recombination of the affiliated sentences and a framework for rating document quality is introduced. Therefore, the taken approach successfully creates a classification and evaluation system.

Place, publisher, year, edition, pages
2016. , 95 p.
Keyword [en]
machine translation evaluation, machine learning
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:lnu:diva-52087OAI: oai:DiVA.org:lnu-52087DiVA: diva2:920202
Supervisors
Examiners
Available from: 2016-05-12 Created: 2016-04-14 Last updated: 2018-01-10Bibliographically approved

Open Access in DiVA

fulltext(2195 kB)189 downloads
File information
File name FULLTEXT01.pdfFile size 2195 kBChecksum SHA-512
3035ddb6864b56329b5f3af85748a586ff8aa0fdb30c68fa11fa63e4a11dc6be0d3438edefa0d3ac9f321e50b24c2fc6a91fa59433778c12c952fe93d2202ba1
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Luckert, MichaelSchaefer-Kehnert, Moritz
By organisation
Department of Computer Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 189 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 196 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf