Change search
ReferencesLink to record
Permanent link

Direct link
Automated classification of Web pages in hierarchical browsing
University of Bath. (Library and Information Science)ORCID iD: 0000-0003-4169-4777
2009 (English)In: Journal of Documentation, ISSN 0022-0418, E-ISSN 1758-7379, Vol. 6, no 65, 901-925 p.Article in journal (Refereed) Published
Abstract [en]

Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use the Engineering Index (Ei) classification scheme for browsing, and then, if proven useful, to investigate the performance of an automated classificationalgorithm based on the Ei classification scheme.

Design/methodology/approach - A user study was conducted in which users solved four controlled searching tasks. The users browsed the Ei classification scheme in order to examine the suitability of the classification systems for browsing. The classification algorithm was evaluated by the users who judged the correctness of the automatically assigned classes.

Findings - The study showed that the Ei classification scheme is suited for browsing. Automatically assigned classes were on average partly correct, with some classes working better than others. Success of browsing showed to be correlated and dependent on classification correctness.

Research limitations/implications - Further research should address problems of disparate evaluations of one and the same web page. Additional reasons behind browsing failures in the Ei classification scheme also need further investigation.

Practical implications - Improvements for browsing were identified: describing class captions and/or listing their subclasses from start; allowing for searching for words from class captions with synonym search (easily provided for Ei since the classes are mapped to thesauri terms); when searching for class captions, returning the hierarchical tree expanded around the class in which caption the search term is found. The need for improvements of classification schemes was also indicated.

Originality/value - A User-based evaluation of automated subject classification in the context of browsing has not been conducted before; hence the study also presents new findings concerning methodology.

Place, publisher, year, edition, pages
Emerald Group Publishing Limited, 2009. Vol. 6, no 65, 901-925 p.
Keyword [en]
Classification, Hierarchical control, Strategic evaluation, User studies
National Category
Information Studies
Research subject
Humanities, Library and Information Science
URN: urn:nbn:se:lnu:diva-37058DOI: 10.1108/00220410910998915OAI: diva2:747716
Available from: 2014-09-17 Created: 2014-09-17 Last updated: 2015-09-30Bibliographically approved

Open Access in DiVA

Fulltext(654 kB)83 downloads
File information
File name FULLTEXT01.pdfFile size 654 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Golub, Koraljka
In the same journal
Journal of Documentation
Information Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 83 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 32 hits
ReferencesLink to record
Permanent link

Direct link