Change search
ReferencesLink to record
Permanent link

Direct link
Implementation och utvärdering av termlänkare i Java
Linköping University, Department of Computer and Information Science, Human-Centered systems. Linköping University, The Institute of Technology.
2013 (Swedish)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

Aligning parallell terms in a parallell corpus can be done by aligning all words and phrases in the corpus and then performing term extraction on the aligned set of word pairs. Alternatively, term extraction in the source and target text can be made separately and then the resulting term candidates can be aligned, forming aligned parallell terms. This thesis describes an implementation of a word aligner that is applied on extracted term candidates in both the source and the target texts. The term aligner uses statistical measures, the tool Giza++ and heuristics in the search for alignments. The evaluation reveals that the best results are obtained when the term alignment relies heavily on the Giza++ tool and Levenshtein heuristic.

Place, publisher, year, edition, pages
2013. , 24 p.
Keyword [sv]
termlänkning, giza++
National Category
Language Technology (Computational Linguistics)
URN: urn:nbn:se:liu:diva-92732ISRN: LIU-IDA/LITH-EX-G--13/005--SEOAI: diva2:621925
Subject / course
Computer and information science at the Institute of Technology
Available from: 2013-05-20 Created: 2013-05-17 Last updated: 2013-05-20Bibliographically approved

Open Access in DiVA

fulltext01(436 kB)153 downloads
File information
File name FULLTEXT01.pdfFile size 436 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Axelsson, Robin
By organisation
Human-Centered systemsThe Institute of Technology
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 153 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 149 hits
ReferencesLink to record
Permanent link

Direct link