Change search
ReferencesLink to record
Permanent link

Direct link
Experiments to investigate the connection between case distribution and topical relevance of search terms
Attityd.
Attityd.
Attityd.
Number of Authors: 3
2008 (English)In: Proceedings of the 8th international conference on Language Resources and Evaluation, LREC'08, 2008, 1, , 5 p.Conference paper (Refereed)
Abstract [en]

We have performed a set of experiments made to investigate the utility of morphological analysis to improve retrieval of documents written in languages with relatively large morphological variation in a practical commercial setting, using the SiteSeeker search system developed and marketed by Euroling AB. The objective of the experiments was to evaluate different lemmatisers and stemmers to determine which would be the most practical for the task at hand: highly interactive, relatively high precision web searches in commercial customer-oriented document collections. This paper gives an overview of some of the results for Finnish and German, and describes specifically one experiment designed to investigate the case distribution of nouns in a highly inflectional language (Finnish) and the topicality of the nouns in target texts. We find that topical nouns taken from queries are distributed differently over relevant and non-relevant documents depending on their grammatical case.

Place, publisher, year, edition, pages
2008, 1. , 5 p.
National Category
Computer and Information Science
Identifiers
URN: urn:nbn:se:ri:diva-22463OAI: oai:DiVA.org:ri-22463DiVA: diva2:1042028
Conference
8th international conference on Language Resources and Evaluation, LREC'08
Projects
Attityd
Available from: 2016-10-31 Created: 2016-10-31

Open Access in DiVA

fulltext(91 kB)3 downloads
File information
File name FULLTEXT01.pdfFile size 91 kBChecksum SHA-512
c3267b68f7a2129de51cf26781532dba014d38f563cc2d794168e7dac8e4aca9a6fd1b193234a5513addbdf28bc7e5426630399fdc025aea00ba5e2bf75ae003
Type fulltextMimetype application/pdf

Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 3 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 2 hits
ReferencesLink to record
Permanent link

Direct link