The Rumble in the Disambiguation Jungle: Towards the comparison of a traditional word sense disambiguation system with a novel paraphrasing system
Independent thesis Basic level (degree of Bachelor), 15 credits / 22,5 HE creditsStudent thesis
Word sense disambiguation (WSD) is the process of computationally identifying and labeling poly- semous words in context with their correct meaning, known as a sense. WSD is riddled with various obstacles that must be overcome in order to reach its full potential. One of these problems is the aspect of the representation of word meaning. Traditional WSD algorithms make the assumption that a word in a given context has only one meaning and therfore can return only one discrete sense. On the other hand, a novel approach is that a given word can have multiple senses. Studies on graded word sense assignment (Erk et al., 2009) as well as in cognitive science (Hampton, 2007; Murphy, 2002) support this theory. It has therefore been adopted in a novel, paraphrasing system which performs word sense disambiguation by returning a probability distribution over potential paraphrases (in this case synonyms) of a given word. However, it is unknown how well this type of algorithm fares against the traditional one. The current study thus examines if and how it is possible to make a comparison of the two. A method of comparison is evaluated and subsequently rejected. Reasons for this as well as suggestions for a fair and accurate comparison are presented.
Place, publisher, year, edition, pages
2011. , 27 p.
Word Sense Disambiguation, WSD, paraphrasing systems, semantics, word senses, representing word meaning
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:su:diva-61642OAI: oai:DiVA.org:su-61642DiVA: diva2:436727
2011-06-01, C307, Universitetsvägen 10C, 106 91 Stockholm, 10:35 (English)
Erk, Katrin, ProfessorÖstling, Robert
Wirén, Mats, Professor