Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Time, technique and text: scoping review of temporal information extraction and categorisation in documents
University of Borås, Faculty of Librarianship, Information, Education and IT.ORCID iD: 0009-0003-6165-2184
2025 (English)In: Journal of Documentation, ISSN 0022-0418, E-ISSN 1758-7379, Vol. 81, no 7, p. 135-156Article in journal (Refereed) Published
Abstract [en]

Purpose:

This paper presents an investigation of the concept of “time as aboutness” in various texts, including news articles, social media posts and historical documents. The purpose of this paper is to analyse different forms of temporal information and map the techniques used to extract and categorise this information.

Design/methodology/approach:

A scoping review method was adopted to analyse the chosen literature set. This approach allowed for an overview of the different text document types, the techniques used and their temporal information.

Findings:

The findings reveal six temporal types of time-related data analysis: social events, socio-political events, news events, temporal expressions, historical events and time periods. Studies analysing social media, news articles, Wikipedia entries and historical documents provide insights into event detection and categorisation. In these documents, time appears as sequences of events, temporal expressions or distinct periods. In news articles, time appears as a series of occurrences, while temporal expressions reveal how time is linguistically articulated and perceived. The analysis also covers event categorisation methods, emphasising machine learning techniques, natural language processing, large language models and rule-based systems.

Originality/value:

The analysis of different types of time and methods of extracting temporal information from various texts contributes original insights to the understanding of temporal information. The findings reveal a need for expanding document variety, particularly to include fiction literature and point to the potential use of language models for future temporal information categorisation.

Place, publisher, year, edition, pages
Emerald Group Publishing Limited, 2025. Vol. 81, no 7, p. 135-156
Keywords [en]
Automated methods, Classification, Ctegorisation, Event, Temporal information, Time as aboutness
National Category
Information Studies
Research subject
Library and Information Science
Identifiers
URN: urn:nbn:se:hb:diva-33459DOI: 10.1108/jd-11-2024-0267Scopus ID: 2-s2.0-105001519323OAI: oai:DiVA.org:hb-33459DiVA, id: diva2:1953195
Available from: 2025-04-17 Created: 2025-04-17 Last updated: 2025-04-28Bibliographically approved

Open Access in DiVA

fulltext(1763 kB)15 downloads
File information
File name FULLTEXT01.pdfFile size 1763 kBChecksum SHA-512
a4503e22559ed425abb17462cea7309c98165e1a4c6ca54b96df21995533d743b1f3cda8176b1d20077772a5bcbf3a13e2fe79493d3cc363725349c84f252066
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Westin, Fereshta
By organisation
Faculty of Librarianship, Information, Education and IT
In the same journal
Journal of Documentation
Information Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 15 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 69 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf