Multi-agent Awareness, No-Retracing, Diagonal Movement for Q-Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Considering the lack of development for faster and more efficient solutions to the Grid World Q-Learning problems, it is no surprise that the performance of Q-Learning for Grid Worlds tends to exponentially reduce in effectiveness, as board size increases. This is a problem, because Q-Learning could (conceivably) solve analytic and mathematical problems, if runtime performance were enhanced. The motivation for this research, therefore, is to increase the runtime performance of Q-Learning for Grid Worlds by development of Multiple Awareness No-Retracing Agents for Q-Learning (or MANAQL). An account of conventional Q-Learning for Grid World environments is given and includes the anatomy of a Q-Learning class, its functions and their intended interactions; A detailed account of the methodology used to create MANAQL and how its functions and components differ from those of conventional Q-Learning is also given. In the Literature Review, several other Q-Learning algorithms that focus on multiple agents are narratively assessed and the question of why so little work has been done on improving Q-Learning for Grid World is asked and answered. The time complexity of the centred and non-centred conventional Q-Learning algorithms are evaluated and compared with each other and with those of the centred and non-centred MANAQL algorithms. Applications of the new algorithm will be the focus of later work.
Original languageEnglish
Title of host publication Advances in Computational Intelligence Systems
Subtitle of host publicationContributions Presented at the 23rd UK Workshop on Computational Intelligence (UKCI 2024), September 2-4, 2024, Ulster University, Belfast, UK
EditorsHuiru Zheng, David Glass, Maurice Mulvenna, Jun Liu, Hui Wang
PublisherSpringer Cham
Pages98-105
Number of pages7
ISBN (Electronic)978-3-031-78857-4
ISBN (Print)978-3-031-78856-7
DOIs
Publication statusPublished online - 8 Jan 2025

Bibliographical note

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG.

Fingerprint

Dive into the research topics of 'Multi-agent Awareness, No-Retracing, Diagonal Movement for Q-Learning'. Together they form a unique fingerprint.

Cite this