Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Text-Based Information Retrieval Using Relevance Feedback
KTH, School of Information and Communication Technology (ICT).
2011 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Europeana, a freely accessible digital library with an idea to make Europe's cultural and scientific heritage available to the public was founded by the European Commission in 2008. The goal was to deliver a semantically enriched digital content with multilingual access to it. Even though they managed to increase the content of data they slowly faced the problem of retrieving information in an unstructured form. So to complement the Europeana portal services, ASSETS (Advanced Search Service and Enhanced Technological Solutions) was introduced with services that sought to improve the usability and accessibility of Europeana.

My contribution is to study different text-based information retrieval models, their relevance feedback techniques and to implement one simple model. The thesis explains a detailed overview of the information retrieval process along with the implementation of the chosen strategy for relevance feedback that generates automatic query expansion. Finally, the thesis concludes with the analysis made using relevance feedback, discussion on the model implemented and then an assessment on future use of this model both as a continuation of my work and using this model in ASSETS.

Place, publisher, year, edition, pages
2011. , 51 p.
Series
Trita-ICT-EX, 281
Keyword [en]
Information Retrieval, Relevance Feedback, Query Expansion, Rocchio classification, Probabilistic model, Lucene, Similarity scoring function, Kullback-Leibler Divergence (KLD)
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:kth:diva-53603OAI: oai:DiVA.org:kth-53603DiVA: diva2:470416
Subject / course
Information and Software Systems
Educational program
Master of Science - Software Engineering of Distributed Systems
Uppsok
Technology
Examiners
Available from: 2011-12-29 Created: 2011-12-29 Last updated: 2011-12-29Bibliographically approved

Open Access in DiVA

fulltext(917 kB)570 downloads
File information
File name FULLTEXT01.pdfFile size 917 kBChecksum SHA-512
1c0d73ad20ae0395552efd30a0effcd0fc70186d0986b6c33f88384c0eb1968fd16f183be99d32ae0cb4ec0d2373fba2bcce5555565a16905041548fd54c39f0
Type fulltextMimetype application/pdf

By organisation
School of Information and Communication Technology (ICT)
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 570 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 233 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf