Cache-Efficient Dynamic Programming MDP Solver

samedi, 30 sept. 2023·

Jaël Champagne Gareau

Guillaume Gosset

Éric Beaudry

Vladimir Makarenkov

PDF Code Projet Poster Diapositives DOI Supplementary Material

Résumé

Automated planning research often focuses on developing new algorithms to improve the computational performance of planners, but effective implementation can also play a significant role. Hardware features such as memory hierarchy can yield substantial running time improvements when optimized. In this paper, we propose two state-reordering techniques for the Topological Value Iteration (TVI) algorithm. Our first technique organizes states in memory so that those belonging to the same Strongly Connected Component (SCC) are contiguous, while our second technique optimizes state value propagation by reordering states within each SCC. We analyze existing planning algorithms with respect to their cache efficiency and describe domain characteristics which can provide an advantage to each of them. Empirical results show that, in many instances, our new algorithms, called eTVI and eiTVI, run several times faster than traditional VI, TVI, LRTDP and ILAO* techniques.

Type

Article de conférence

Publication

Proceedings of the 26th European Conference on Artificial Intelligence

Dernière mise à jour le samedi, 16 avr. 2022

MDP Cache Memory Algorithms

Auteurs

Jaël Champagne Gareau

Chercheur postdoctoral en informatique

Je suis actuellement chercheur postdoctoral en informatique à l’Université TÉLUQ, où mes travaux portent sur l’accélération de la conversion de nombres entiers et flottants en chaînes de caractères décimales. Au cours de mon doctorat, j’ai conçu des algorithmes et des structures de données exploitant l’architecture moderne des ordinateurs afin de résoudre de grandes instances de processus décisionnels de Markov (MDP). Durant ma maîtrise, j’ai développé des algorithmes de planification d’itinéraires pour véhicules électriques, visant à déterminer le chemin optimal entre deux points tout en minimisant le temps total du trajet (déplacement, recharge et attente aux bornes).

Auteurs

Guillaume Gosset

Auteurs

Éric Beaudry

Auteurs

Vladimir Makarenkov

← Cooperative Electric Vehicles Planning jeudi, 09 mai 2024

Fast and optimal branch-and-bound planner for the grid-based coverage path planning problem based on an admissible heuristic function vendredi, 27 janv. 2023 →