Change search
ReferencesLink to record
Permanent link

Direct link
When is the Structural Context Effective?
Norwegian University of Science and Technology, Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Computer and Information Science.
Tampereen yliopisto - University of Tampere.
2013 (English)In: CEUR Workshop Proceedings, ISSN 1613-0073Article in journal (Refereed) Published
Place, publisher, year, edition, pages
URN: urn:nbn:no:ntnu:diva-23217OAI: diva2:659158
Available from: 2013-10-24 Created: 2013-10-24 Last updated: 2014-04-23Bibliographically approved
In thesis
1. The Contextual Features in Schema-Agnostic Environment
Open this publication in new window or tab >>The Contextual Features in Schema-Agnostic Environment
2014 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Relevance scoring and estimation deals with both finding the relevant set of answers and ordering them according to the degree of their relevance to the user-intent. The traditional information retrieval (IR) systems successfully find and order the relevant documents and leave them to the users, who then have to locate the relevant information embedded somewhere within the document. In contrast, estimating relevance in semi-structured retrieval means not only retrieving and ordering the relevant documents but also locating the relevant information within the document as well. When it comes to semi-structured retrieval, the traditional IR style retrieval is simply insufficient. The main focus of this thesis is estimating relevance in a schema-agnostic environment. Here, “schema-agnostic” means that the schema or the structure exists explicitly within the documents but the user does not or need not know that schema. In such an environment, the structure is generally defined loosely, which means: (a) it can evolve over time, (b) it can constitute a large part of the data, and (c) it might exist seamlessly within the document. The natural question that comes into mind is, why is such a structure there at all? The structure in a schemaagnostic environment is there to be used by retrieval systems for several useful tasks. This thesis is about unveiling the capabilities of the structural constructs within semi-structured documents in schema-agnostic settings. Structural constructs can form what we call the structural context of the relevant item. A structural context builds up the internal and external contextual features of a semi-structured document. These contextual features help with a series of tasks. The work presented in this thesis contributes towards understanding and utilizing the contextual features in the retrieval of focused information in schema-agnostic settings. During the course of this study we have identified, implemented and experimented with several intuitive types of contextual features in semi-structured retrieval settings. Contextualization is the generic process of utilizing features in the structural context of the retrievable units in relevance scoring. The proposed retrieval approaches, based mainly on contextual features, exhibited notable improvements in retrieval effectiveness, during empirical analyses. The evaluations and empirical analyses are performed in several tasks, spread across different phases of this study. The tasks are performed by looking at different aspects and challenges of the semi-structured retrieval domain. The following tasks are performed at different phases of this study: ad-hoc tasks, granulation tasks, and standard tasks offered by INitiative for the Evaluation of Xml retrieval (INEX). The contributions of this thesis are also grouped by these tasks.

Place, publisher, year, edition, pages
NTNU: , 2014
Doctoral theses at NTNU, ISSN 1503-8181 ; 2014:98
National Category
Information and communication science
urn:nbn:no:ntnu:diva-24361 (URN)978-82-326-0120-2 (printed ver.) (ISBN)978-82-326-0121-9 (electronic ver.) (ISBN)
Public defence
2014-04-04, 14:15
Available from: 2014-04-23 Created: 2014-03-25 Last updated: 2014-05-07Bibliographically approved

Open Access in DiVA

fulltekst(346 kB)14 downloads
File information
File name FULLTEXT01.pdfFile size 346 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

CEUR Workshop Proceedings
By organisation
Department of Computer and Information Science
In the same journal
CEUR Workshop Proceedings

Search outside of DiVA

GoogleGoogle Scholar
Total: 14 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 32 hits
ReferencesLink to record
Permanent link

Direct link