Change search
ReferencesLink to record
Permanent link

Direct link
Improving machine translation quality prediction with syntactic tree kernels
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology. (Datorlingvistik)
2011 (English)In: Proceedings of the 15th conference of the European Association for Machine Translation (EAMT 2011) / [ed] Mikel L. Forcada, Heidi Depraetere, Vincent Vandeghinste, European Association for Machine Translation (EAMT), 2011, 233-240 p.Conference paper (Refereed)
Abstract [en]

We investigate the problem of predicting the quality of a given Machine Translation (MT) output segment as a binary classification task. In a study with four different data sets in two text genres and two language pairs, we show that the performance of a Support Vector Machine (SVM) classifier can be improved by extending the feature set with implicitly defined syntactic features in the form of tree kernels over syntactic parse trees. Moreover, we demonstrate that syntax tree kernels achieve surprisingly high performance levels even without additional features, which makes them suitable as a low-effort initial building block for an MT quality estimation system.

Place, publisher, year, edition, pages
European Association for Machine Translation (EAMT), 2011. 233-240 p.
Keyword [en]
MT quality prediction, Tree kernels
National Category
Language Technology (Computational Linguistics)
Research subject
Computational Linguistics
URN: urn:nbn:se:uu:diva-162883OAI: diva2:462153
EAMT 2011, Leuven, Belgium, May 30, 2011 - May 31, 2011
Available from: 2011-12-06 Created: 2011-12-05 Last updated: 2013-09-19Bibliographically approved

Open Access in DiVA

EAMT2011(632 kB)2965 downloads
File information
File name FULLTEXT01.pdfFile size 632 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Conference website

Search in DiVA

By author/editor
Hardmeier, Christian
By organisation
Department of Linguistics and Philology
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 2965 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 212 hits
ReferencesLink to record
Permanent link

Direct link