Change search
ReferencesLink to record
Permanent link

Direct link
Benchmarking of Data Mining Techniques as Applied to Power System Analysis
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology.
2013 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The field of electric power systems is currently facing explosive growth in the amount of data. Since extracting useful information from this enormous amount of data is highly complex, costly, and time consuming, data mining can play a key role. In particular, the standard data mining algorithms for the analysis of huge data volumes can be parallelized for faster processing. This thesis focuses on benchmarking of parallel processing platforms; it employs data parallelization using Apache Hadoop cluster (MapReduce paradigm) and shared-memory parallelization using multi-cores on a single machine. As a starting point, we conduct real-time experiments in order to evaluate the efficacy of these two parallel processing platforms in terms of performance, resource usage (Memory), efficiency (including speed-up), accuracy, and scalability. The end result shows that the data mining methods can indeed be implemented as efficient parallel processes, and can be used to obtain useful resultsf rom huge amount of data in a case study scenario. Overall, we establish that parallelization using Apache Hadoop cluster is a promising model for scalable performance compared with the alternative suitable parallelization using multi-cores on a single machine

Place, publisher, year, edition, pages
IT, 13 061
National Category
Engineering and Technology
URN: urn:nbn:se:uu:diva-207625OAI: diva2:648950
Educational program
Freestanding course
Available from: 2013-09-17 Created: 2013-09-17 Last updated: 2013-12-02Bibliographically approved

Open Access in DiVA

fulltext(2887 kB)1209 downloads
File information
File name FULLTEXT01.pdfFile size 2887 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 1209 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 639 hits
ReferencesLink to record
Permanent link

Direct link