Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Utilizing Diversity and Performance Measures for Ensemble Creation
University of Borås, School of Business and IT.
2009 (English)Licentiate thesis, monograph (Other academic)
Abstract [en]

An ensemble is a composite model, aggregating multiple base models into one predictive model. An ensemble prediction, consequently, is a function of all included base models. Both theory and a wealth of empirical studies have established that ensembles are generally more accurate than single predictive models. The main motivation for using ensembles is the fact that combining several models will eliminate uncorrelated base classifier errors. This reasoning, however, requires the base classifiers to commit their errors on different instances – clearly there is no point in combining identical models. Informally, the key term diversity means that the base classifiers commit their errors independently of each other. The problem addressed in this thesis is how to maximize ensemble performance by analyzing how diversity can be utilized when creating ensembles. A series of studies, addressing different facets of the question, is presented. The results show that ensemble accuracy and the diversity measure difficulty are the two individually best measures to use as optimization criterion when selecting ensemble members. However, the results further suggest that combinations of several measures are most often better as optimization criteria than single measures. A novel method to find a useful combination of measures is proposed in the end. Furthermore, the results show that it is very difficult to estimate predictive performance on unseen data based on results achieved with available data. Finally, it is also shown that implicit diversity achieved by varied ANN architecture or by using resampling of features is beneficial for ensemble performance.

Place, publisher, year, edition, pages
Örebro universitet , 2009.
Keyword [en]
ensemble learning, machine learning, diversity, artificial neural networks, information fusion, Computer Science
Keyword [sv]
data mining
National Category
Computer and Information Science Information Systems
Identifiers
URN: urn:nbn:se:hb:diva-3509Local ID: 2320/4976OAI: oai:DiVA.org:hb-3509DiVA: diva2:876899
Funder
Knowledge Foundation, 2003/0104
Note

Sponsorship:

This work was supported by the Information Fusion Research Program (www.infofusion.se) at the University of Skövde, Sweden, in partnership with the Swedish Knowledge Foundation under grant 2003/0104.

Available from: 2015-12-04 Created: 2015-12-04 Last updated: 2016-08-19Bibliographically approved

Open Access in DiVA

fulltext(1235 kB)138 downloads
File information
File name FULLTEXT01.pdfFile size 1235 kBChecksum SHA-512
d9f5b6aeca1b2eb8176503517decc6b22e3247eeff4f98cb13f6e0e28a5b85def7f24319427fb8d818103242c252c1fbff0a62255f8e8967d78b445699dc764f
Type fulltextMimetype application/pdf
presentation(429 kB)87 downloads
File information
File name FULLTEXT02.pdfFile size 429 kBChecksum SHA-512
b20f46458bb5582c06776499e201d51ad5a40ad75aeab944e1fdba0c822928aff72d7dd01985faca6f9020979168a14b71b7c54591ee2e4206d0e3c6813b3ebb
Type attachmentMimetype application/pdf

Search in DiVA

By author/editor
Löfström, Tuve
By organisation
School of Business and IT
Computer and Information ScienceInformation Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 225 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 102 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf