Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Gold Standard for English-Swedish Word Alignment
Linköping University, Department of Computer and Information Science, NLPLAB - Natural Language Processing Laboratory. Linköping University, The Institute of Technology. (HCS)
Linköping University, Department of Computer and Information Science, NLPLAB - Natural Language Processing Laboratory. Linköping University, The Institute of Technology. (HCS)
2011 (English)In: Proceedings of the 18th Nordic Conference of Computational Linguistics NODALIDA 2011 / [ed] Bolette Sandford Pedersen, Gunta Nepore and Inguna Skadina, Tartu, Estland, 2011, p. 106-113Conference paper, Poster (with or without abstract) (Other academic)
Abstract [en]

Word alignment gold standards are an importantresource for developing and evaluatingword alignment methods. In thispaper we present a free English–Swedishword alignment gold standard consistingof texts from Europarl with manually verifiedword alignments. The gold standardcontains two sets of word aligned sentences,a test set for the purpose of evaluationand a training set that can be usedfor supervised training. The guidelinesused for English–Swedish alignment werecreated based on guidelines for other languagepairs and with statistical machinetranslation as the targeted application. Wealso present results of intrinsic evaluationusing our gold standard and discuss the relationshipto extrinsic evaluation in a statisticalmachine translation system.

Place, publisher, year, edition, pages
Tartu, Estland, 2011. p. 106-113
Series
NEALT Proceedings Series, ISSN 1736-6305 ; 11
Keywords [en]
Machine translation, Evaluation, Gold standard, Word alignment
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:liu:diva-80286OAI: oai:DiVA.org:liu-80286DiVA, id: diva2:546418
Conference
NODALIDA 2011: 18th Nordic Conference of Computational Linguistics, May 11-13 2011, Riga, Latvia
Projects
Multilingual extraction and term structuring
Funder
Swedish Research Council, 621-2008-4664Available from: 2012-08-23 Created: 2012-08-23 Last updated: 2012-08-30

Open Access in DiVA

No full text in DiVA

Other links

http://dspace.utlib.ee/dspace/handle/10062/16955

Search in DiVA

By author/editor
Holmqvist, MariaAhrenberg, Lars
By organisation
NLPLAB - Natural Language Processing LaboratoryThe Institute of Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 142 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf