Change search
ReferencesLink to record
Permanent link

Direct link
Utilizing state-of-art NeuroES and GPGPU to optimize Mario AI
Blekinge Institute of Technology, Faculty of Computing, Department of Creative Technologies.
2014 (English)Student thesis
Abstract [en]

Context. Reinforcement Learning (RL) is a time consuming effort that requires a lot of computational power as well. There are mainly two approaches to improving RL efficiency, the theoretical mathematics and algorithmic approach or the practical implementation approach. In this study, the approaches are combined in an attempt to reduce time consumption.\newline Objectives. We investigate whether modern hardware and software, GPGPU, combined with state-of-art Evolution Strategies, CMA-Neuro-ES, can potentially increase the efficiency of solving RL problems.\newline Methods. In order to do this, both an implementational as well as an experimental research method is used. The implementational research mainly involves developing and setting up an experimental framework in which to measure efficiency through benchmarking. In this framework, the GPGPU/ES solution is later developed. Using this framework, experiments are conducted on a conventional sequential solution as well as our own parallel GPGPU solution.\newline Results. The results indicate that utilizing GPGPU and state-of-art ES when attempting to solve RL problems can be more efficient in terms of time consumption in comparison to a conventional and sequential CPU approach.\newline Conclusions. We conclude that our proposed solution requires additional work and research but that it shows promise already in this initial study. As the study is focused on primarily generating benchmark performance data from the experiments, the study lacks data on RL efficiency and thus motivation for using our approach. However we do conclude that the GPGPU approach suggested does allow less time consuming RL problem solving.

Place, publisher, year, edition, pages
2014. , 62 p.
Keyword [en]
Reinforcement Learning, Evolution Strategies, GPGPU, Artifical Neural Networks
National Category
Computer Science
URN: urn:nbn:se:bth-4386Local ID: diva2:831724
Educational program
PAACI Master of Science in Game and Software Engineering
Available from: 2015-04-22 Created: 2014-06-27 Last updated: 2016-02-24Bibliographically approved

Open Access in DiVA

fulltext(1605 kB)64 downloads
File information
File name FULLTEXT01.pdfFile size 1605 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Creative Technologies
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 64 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 95 hits
ReferencesLink to record
Permanent link

Direct link