Modelling Pronominal Anaphora in Statistical Machine Translation
2010 (English)In: Proceedings of the 7th International Workshop on Spoken Language Translation, Paris. / [ed] Marcello Federico et al., Paris, 2010, 283-289 p.Conference paper (Refereed)
Current Statistical Machine Translation (SMT) systems translate texts sentence by sentence without considering any cross-sentential context. Assuming independence between sentences makes it difficult to take certain translation decisions when the necessary information cannot be determined locally. We argue for the necessity to include cross-sentence dependencies in SMT. As a case in point, we study the problem of pronominal anaphora translation by manually evaluating German-English SMT output. We then present a word dependency model for SMT, which can represent links between word pairs in the same or in different sentences. We use this model to integrate the output of a coreference resol- ution system into English-German SMT with a view to improving the translation of anaphoric pronouns.
Place, publisher, year, edition, pages
Paris, 2010. 283-289 p.
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:uu:diva-154490OAI: oai:DiVA.org:uu-154490DiVA: diva2:420761
IWSLT (International Workshop on Spoken Language Translation); Paris, France; December 2nd and 3rd, 2010.