Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Comparative Review of SMOTE and ADASYN in Imbalanced Data Classification
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Statistics.
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Statistics.
2021 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

In this thesis, the performance of two over-sampling techniques, SMOTE and ADASYN, is compared. The comparison is done on three imbalanced data sets using three different classification models and evaluation metrics, while varying the way the data is pre-processed. The results show that both SMOTE and ADASYN improve the performance of the classifiers in most cases. It is also found that SVM in conjunction with SMOTE performs better than with ADASYN as the degree of class imbalance increases. Furthermore, both SMOTE and ADASYN increase the relative performance of the Random forest as the degree of class imbalance grows. However, no pre-processing method consistently outperforms the other in its contribution to better performance as the degree of class imbalance varies.

Place, publisher, year, edition, pages
2021. , p. 42
Keywords [en]
Machine learning, supervised learning, classification, class imbalance, over-sampling, SMOTE, ADASYN, Sensitivity, F-measure, Matthews correlation coefficient
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:uu:diva-432162OAI: oai:DiVA.org:uu-432162DiVA, id: diva2:1519153
Subject / course
Statistics
Educational program
Bachelor Programme in Business and Economics
Supervisors
Examiners
Available from: 2021-01-26 Created: 2021-01-18 Last updated: 2021-01-26Bibliographically approved

Open Access in DiVA

fulltext(2163 kB)15096 downloads
File information
File name FULLTEXT01.pdfFile size 2163 kBChecksum SHA-512
364b5ae5e216a945a143c4615220cea42a29b9b49082796fe147f6cf575bd0482dfeccb3418d130125984a520a6b7c40aeda82bd471c559fc0aeb914d5764fe3
Type fulltextMimetype application/pdf

By organisation
Department of Statistics
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 15100 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 14076 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf