Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Hierarchical reinforcement learning in network routing optimization
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Exact and reinforcement learning approaches have shown clear limitations when attempting to solve the vehicle routing problem (VRP) and its variants, while no research study until now has applied a hierarchical reinforcement learning (HRL) approach on the VRP with stochastic service request (VRPSSR). This thesis proposes a novel HRL framework that optimizes the network routing for the VRPSSR. The proposed framework trains upper and lower level tabular policies that collaborate to optimize the network routing. The framework is assessed in a series

of experiments based on three key metrics: output quality, success rate, and CPU time. In the experiments, multiple scenarios are defined using different levels of environment complexity by adjusting the number of nework customers and the probability of new customer requests. Using optimization benchmarking, the results of the HRL framework are compared against two benchmark algorithms: dynamic programming (DP) and traditional reinforcement learning (RL). The analysis of the results shows that HRL outperformed RL in all three metrics among nearly all scenarios of complexity. The HRL framework also closely approximated the DP output quality while obtaining significantly shorter CPU times. Although bound to limitations related to computational power and resources, the framework demonstrates highly promising results and has the potential to contribute into the area of real world combinatorial problems where the complexity of the environment increases singificantly.

Place, publisher, year, edition, pages
2024.
Keywords [en]
Hierarchical reinforcement learning, vehicle routing problem, combinatorial optimization, network routing optimization, stochastic service requests
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:su:diva-242734OAI: oai:DiVA.org:su-242734DiVA, id: diva2:1955666
Available from: 2025-04-30 Created: 2025-04-30

Open Access in DiVA

fulltext(1756 kB)17 downloads
File information
File name FULLTEXT01.pdfFile size 1756 kBChecksum SHA-512
37cf7caefad560b17369236957e2ae0dfd5e170ecca342ddaffc7bd6da39d58096b5ad4afdb723dbb8ea184110b3d68596fcc66ba79d29e1265d279028ceec53
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Karampatzakis, Nikolaos
By organisation
Department of Computer and Systems Sciences
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 17 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 21 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf