Improving Machine Translation Quality Prediction with Syntactic Tree Kernels

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

We investigate the problem of predicting the quality of a given Machine Translation (MT) output segment as a binary classification task. In a study with four different data sets in two text genres and two language pairs, we show that the performance of a Support Vector Machine (SVM) classifier can be improved by extending the feature set with implicitly defined syntactic features in the form of tree kernels over syntactic parse trees. Moreover, we demonstrate that syntax tree kernels achieve surprisingly high performance levels even without additional features, which makes them suitable as a low-effort initial building block for an MT quality estimation system.
Original languageEnglish
Title of host publicationProceedings of the 15th International Conference of the European Association for Machine Translation
Publication date31 May 2011
Publication statusPublished - 31 May 2011
Externally publishedYes

Fingerprint

Dive into the research topics of 'Improving Machine Translation Quality Prediction with Syntactic Tree Kernels'. Together they form a unique fingerprint.

Cite this