Change search
ReferencesLink to record
Permanent link

Direct link
Token and Type Constraints for Cross-Lingual Part-of-Speech Tagging
Show others and affiliations
Number of Authors: 5
2013 (English)In: Transactions of the Association for Computational Linguistics, Vol. 1, 1-12 p.Article in journal (Refereed) Published
Abstract [en]

We consider the construction of part-of-speech taggers for resource-poor languages. Recently, manually constructed tag dictionaries from Wiktionary and dictionaries projected via bitext have been used as type constraints to overcome the scarcity of annotated data in this setting. In this paper, we show that additional token constraints can be projected from a resource-rich source language to a resource-poor target language via word-aligned bitext. We present several models to this end; in particular a partially observed conditional random field model, where coupled token and type constraints provide a partial signal for training. Averaged across eight previously studied Indo-European languages, our model achieves a 25% relative error reduction over the prior state of the art. We further present successful results on seven additional languages from different families, empirically demonstrating the applicability of coupled token and type constraints across a diverse set of languages.

Place, publisher, year, edition, pages
Association for Computational Linguistics , 2013, 8. Vol. 1, 1-12 p.
National Category
Computer and Information Science
URN: urn:nbn:se:ri:diva-15351OAI: diva2:1036668
Available from: 2016-10-13 Created: 2016-10-13

Open Access in DiVA

fulltext(317 kB)3 downloads
File information
File name FULLTEXT01.pdfFile size 317 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links


Search in DiVA

By author/editor
Das, DipanjanPetrov, Slav
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 3 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

ReferencesLink to record
Permanent link

Direct link