Change search
ReferencesLink to record
Permanent link

Direct link
Data mining file sharing metadata: A comparison between Random Forests Classificiation and Bayesian Networks
University of Skövde, School of Informatics.
2015 (English)Independent thesis Basic level (degree of Bachelor), 15 credits / 22,5 HE creditsStudent thesis
Abstract [en]

In this comparative study based on experimentation it is demonstrated that the two evaluated machine learning techniques, Bayesian networks and random forests, have similar predictive power in the domain of classifying torrents on BitTorrent file sharing networks.

This work was performed in two steps. First, a literature analysis was performed to gain insight into how the two techniques work and what types of attacks exist against BitTorrent file sharing networks. After the literature analysis, an experiment was performed to evaluate the accuracy of the two techniques.

The results show no significant advantage of using one algorithm over the other when only considering accuracy. However, ease of use lies in Random forests’ favour because the technique requires little pre-processing of the data and still generates accurate results with few false positives.

Place, publisher, year, edition, pages
2015. , 43 p.
Keyword [en]
machine learning, random forests, bayesian network, bittorrent, file sharing
National Category
Computer Science
URN: urn:nbn:se:his:diva-11180OAI: diva2:823863
Subject / course
Computer Science
Educational program
Computer Science - Specialization in Systems Development
Available from: 2015-09-04 Created: 2015-06-18 Last updated: 2015-09-04Bibliographically approved

Open Access in DiVA

fulltext(1558 kB)29 downloads
File information
File name FULLTEXT01.pdfFile size 1558 kBChecksum SHA-512
Type fulltextMimetype application/pdf
bilaga(2415 kB)3 downloads
File information
File name ATTACHMENT01.zipFile size 2415 kBChecksum SHA-512
Type attachmentMimetype application/zip

By organisation
School of Informatics
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 29 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 297 hits
ReferencesLink to record
Permanent link

Direct link