Implementation och utvärdering av termlänkare i Java
Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Aligning parallell terms in a parallell corpus can be done by aligning all words and phrases in the corpus and then performing term extraction on the aligned set of word pairs. Alternatively, term extraction in the source and target text can be made separately and then the resulting term candidates can be aligned, forming aligned parallell terms. This thesis describes an implementation of a word aligner that is applied on extracted term candidates in both the source and the target texts. The term aligner uses statistical measures, the tool Giza++ and heuristics in the search for alignments. The evaluation reveals that the best results are obtained when the term alignment relies heavily on the Giza++ tool and Levenshtein heuristic.
Place, publisher, year, edition, pages
2013. , 24 p.
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:liu:diva-92732ISRN: LIU-IDA/LITH-EX-G--13/005--SEOAI: oai:DiVA.org:liu-92732DiVA: diva2:621925
Subject / course
Computer and information science at the Institute of Technology
Merkel, Magnus, Associate Professor (docent)