Change search
ReferencesLink to record
Permanent link

Direct link
Webometrics benefitting from web mining? An investigation of methods and applications of two research fields
University of Borås, Swedish School of Library and Information Science.ORCID iD: 0000-0003-0659-4754
2014 (English)In: Scientometrics, ISSN 0138-9130, E-ISSN 1588-2861, Vol. 99, no 2Article in journal (Refereed) Published
Abstract [en]

This is a cross-field literature review and comparison of the fields webometrics (cybermetrics) and web (data) mining.

Place, publisher, year, edition, pages
Akademiai Kiado Rt. , 2014. Vol. 99, no 2
Keyword [en]
webometrics, web mining, cybermetrics, web data mining, literature review, interdisciplinary studies
National Category
Computer and Information Science Social Sciences Interdisciplinary Information Studies
Research subject
Library and Information Science
URN: urn:nbn:se:hb:diva-1748DOI: 10.1007/s11192-013-1227-xISI: 000334277800011Local ID: 2320/13196OAI: diva2:869817
Available from: 2015-11-13 Created: 2015-11-13 Last updated: 2016-09-06Bibliographically approved
In thesis
1. Following Tweets Around: Informetric methodology for the Twittersphere
Open this publication in new window or tab >>Following Tweets Around: Informetric methodology for the Twittersphere
2016 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The purpose of this thesis is to critically discuss methods to collect and analyse data related to the interaction and content on the social platform Twitter. The thesis contains examples of how networked communication can be studied on Twitter, based on the affordances of the platform considering interaction with interfaces and other users. The foundational problem is that social science Twitter research has been based on easily accessible data without introducing or discussing criteria for collecting appropriate samples for a given research task.The thesis builds on one literature review and four studies of political Twitter communication. The analyses are based on a view of the Twitter platform as a non-neutral filtering gatekeeper. On the one hand, Twitter treats content and users asymmetrically, by emphasising the popular. On the other hand, Twitter determines what data are available and how data can be accessed through the API (application programming interface). How Twitter provides access to the data in turn affects the analyses the researcher does. The central problem of the thesis is that researchers do not know what relevant data are not collected. Data collection based on keywords, hashtags or users creates data sets that contain fragments of conversations. To solve the problem, a new method was developed. By combining the hashtag and user-based methods, replies to collected tweets were stored, regardless if they contained a tracked hashtag or not.The four studies this thesis builds on show a complexity of collecting and analysing Twitter data. A key finding is that conversations beyond the hashtag can be quite extensive. As a consequence of this, communication networks based on hashtagged replies were found to be potentially very different from networks based on replies from a more complete data set, where non-hashtagged replies are also included. A network based on hashtagged communication is thus misleading compared to a complete communication network.Apart from that it is not entirely trivial to identify the parameters to define what should be studied; tests of the API showed that complete data sets cannot be obtained. Therefore, it is important to reflect on both the data collected and the data excluded, not only as a result of the sampling criteria but also what is not given access to. It is also important to be clear about the affordances for interaction that exist when the study is made, both in the user interface but also what API allows and permits.This research contributes with knowledge about how Twitter is used in the context being studied, but the main contribution is methodological. With the method developed, collection of more complete data sets is enabled, as is analysis of the conversations that take place on the platform. This results in more accurate measurements of the activity. Based on the results of this thesis, there are reasons to suspect that previous studies could differ in terms of results such as communication network size and shape, as well as the type of users that emerges as prominent in the material, compared to if replies that do not contain the studied hashtag had been collected.

Place, publisher, year, edition, pages
Borås: Högskolan i Borås, 2016
Skrifter från Valfrid, ISSN 1103-6990 ; 61
National Category
Information Studies
Research subject
Library and Information Science
urn:nbn:se:hb:diva-9339 (URN)978-91-981653-0-2 (ISBN)978-91-981653-1-9 (pdf) (ISBN)
External cooperation:
Public defence
2016-10-03, C203, Allégatan 1, Borås, 13:00

Due to copyright, the articles included in this PhD thesis are not available in the digital version of the thesis. Find links to the published articles in the list of papers below.

The article: Lorentzen, D. G. (manuscript). Is it all about politics? A hashtag analysis of the activities of the Swedish political Twitter elite. Is not yet submitted to a journal and is only found in the printed version.

Available from: 2016-09-06 Created: 2016-03-09 Last updated: 2016-09-12Bibliographically approved

Open Access in DiVA

fulltext(550 kB)10 downloads
File information
File name FULLTEXT01.pdfFile size 550 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Gunnarsson Lorentzen, David
By organisation
Swedish School of Library and Information Science
In the same journal
Computer and Information ScienceSocial Sciences InterdisciplinaryInformation Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 10 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 80 hits
ReferencesLink to record
Permanent link

Direct link