Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Predicting movie ratings: A comparative study on random forests and support vector machines
University of Skövde, School of Informatics.
2015 (English)Independent thesis Basic level (degree of Bachelor), 15 credits / 22,5 HE creditsStudent thesis
Abstract [en]

The aim of this work is to evaluate the prediction performance of random forests in comparison to support vector machines, for predicting the numerical user ratings of a movie using pre-release attributes such as its cast, directors, budget and movie genres.

In order to answer this question an experiment was conducted on predicting the overall user rating of 3376 hollywood movies, using data from the well established movie database IMDb. The prediction performance of the two algorithms was assessed and compared over three commonly used performance and error metrics, as well as evaluated by the means of significance testing in order to further investigate whether or not any significant differences could be identified.

The results indicate some differences between the two algorithms, with consistently better performance from random forests in comparison to support vector machines over all of the performance metrics, as well as significantly better results for two out of three metrics. Although a slight difference has been indicated by the results one should also note that both algorithms show great similarities in terms of their prediction performance, making it hard to draw any general conclusions on which algorithm yield the most accurate movie predictions. 

Place, publisher, year, edition, pages
2015. , 33 p.
Keyword [en]
data mining, machine learning, regression, movie prediction, random forests, support vector machines

National Category
Computer Science
Identifiers
URN: urn:nbn:se:his:diva-11119OAI: oai:DiVA.org:his-11119DiVA: diva2:821533
Subject / course
Computer Science
Educational program
Computer Science - Specialization in Systems Development
Supervisors
Examiners
Available from: 2015-08-03 Created: 2015-06-15 Last updated: 2015-08-03Bibliographically approved

Open Access in DiVA

fulltext(1621 kB)3937 downloads
File information
File name FULLTEXT01.pdfFile size 1621 kBChecksum SHA-512
86bf93a628405f60a9c1bada584bb09ba0f15be23a1efa6884bce3959d3eb1abcd75393c72aa128103aa4ec97a9c0bd53fffb495a1c0ed2b25c3de281ec40d24
Type fulltextMimetype application/pdf

By organisation
School of Informatics
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 3937 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 688 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf