Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Obtaining the Pareto Optimal Front for Multiobjective Reinforcement Learning Based Wireless Routing
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
2024 (English)Independent thesis Advanced level (degree of Master (One Year)), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

With the increasing complexity of wireless networks, the task of optimizing routing protocols for efficient data transmission becomes more and more challenging. Traditional algorithms frequently struggle to find a balance between multiple conflicting objectives, resulting in a suboptimal solution. This thesis addresses the problem of achieving the Pareto Optimal Front in Multiobjective Reinforcement Learning (MORL)-based wireless routing. The primary question addressed here focuses on how the Pareto Optimal Front can be constructed for MORL-based wireless routing that optimizes both packet delivery rate (PDR) and delay concurrently. We proposed a new framework for MORL-based wireless routing that uses a custom simulation environment named “Wireless Route Lake,” inspired by the Frozen Lake environment. This environment tries to create a realistic simulation of dynamic wireless network conditions, enabling the agent to learn and adjust its routing strategies. The suggested routing algorithm was implemented, tested, and improved iteratively using the Design Science Approach. An in-depth examination of exploration rates, Q-values, and performance metrics reveals the strength of the MORL framework in effectively managing conflicting objectives. The simulations’ results show that the proposed algorithm was able to build the Pareto Optimal Front with good performance, achieving an 85% PDR while effectively handling delays. This thesis successfully develops a robust MORL-based routing algorithm that balances PDR and delay, obtaining the Pareto Optimal Front in wireless networks.

Place, publisher, year, edition, pages
2024.
Keywords [en]
Multiobjective Reinforcement Learning, Pareto Optimal Front, Wireless Routing, Reinforcement Learning, Multiobjective Optimization, Packet Delivery Rate, Delay.
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:su:diva-242781OAI: oai:DiVA.org:su-242781DiVA, id: diva2:1955713
Available from: 2025-04-30 Created: 2025-04-30

Open Access in DiVA

fulltext(1050 kB)7 downloads
File information
File name FULLTEXT01.pdfFile size 1050 kBChecksum SHA-512
cee3a460f7f58a45022cf934f7c29b64c91f21f74c3fb4c3a8ac13e21f085e17bcff7c07a956976be967bfcd4815896a2d9c46f777635cee5a4ede8d0298bf20
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Gupta, Preet Devendra
By organisation
Department of Computer and Systems Sciences
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 7 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 32 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf