Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Benchmarking of Data Mining Techniques as Applied to Power System Analysis
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology.
2013 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The field of electric power systems is currently facing explosive growth in the amount of data. Since extracting useful information from this enormous amount of data is highly complex, costly, and time consuming, data mining can play a key role. In particular, the standard data mining algorithms for the analysis of huge data volumes can be parallelized for faster processing. This thesis focuses on benchmarking of parallel processing platforms; it employs data parallelization using Apache Hadoop cluster (MapReduce paradigm) and shared-memory parallelization using multi-cores on a single machine. As a starting point, we conduct real-time experiments in order to evaluate the efficacy of these two parallel processing platforms in terms of performance, resource usage (Memory), efficiency (including speed-up), accuracy, and scalability. The end result shows that the data mining methods can indeed be implemented as efficient parallel processes, and can be used to obtain useful resultsf rom huge amount of data in a case study scenario. Overall, we establish that parallelization using Apache Hadoop cluster is a promising model for scalable performance compared with the alternative suitable parallelization using multi-cores on a single machine

Place, publisher, year, edition, pages
2013.
Series
IT, 13 061
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:uu:diva-207625OAI: oai:DiVA.org:uu-207625DiVA: diva2:648950
Educational program
Freestanding course
Supervisors
Examiners
Available from: 2013-09-17 Created: 2013-09-17 Last updated: 2013-12-02Bibliographically approved

Open Access in DiVA

fulltext(2887 kB)1266 downloads
File information
File name FULLTEXT01.pdfFile size 2887 kBChecksum SHA-512
999c8f426a08d86aa0a917e4df9933bc417bcc3dddecf3f0f20eef2e818cb58d691042b46fc56ccb30de02db1c9c37276058d922d77a102155ef2c52235275bc
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 1266 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 908 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf