Using Linguistic Annotations in Statistical Machine Translation of Film Subtitles
2009 (English)In: Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA 2009 / [ed] Kristiina Jokinen and Eckhard Bick, Tartu: Tartu University Library , 2009, 57-64 p.Conference paper (Refereed)
Statistical Machine Translation (SMT) has been successfully employed to support translation of film subtitles. We explore the integration of Constraint Grammar corpus annotations into a Swedish–Danish subtitle SMT system in the framework of factored SMT. While the usefulness of the annotations is limited with large amounts of parallel data, we show that linguistic annotations can increase the gains in translation quality when monolingual data in the target language is added to an SMT system based on a small parallel corpus.
Place, publisher, year, edition, pages
Tartu: Tartu University Library , 2009. 57-64 p.
NEALT Proceedings Series, ISSN 1736-6305 ; 4
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:uu:diva-154484OAI: oai:DiVA.org:uu-154484DiVA: diva2:420752
NODALIDA 2009, May 14-16, 2009 University of Southern Denmark in Odense, Denmark