Genetic network programming with reinforcement learning and optimal search component : a thesis presented in partial fulfilment of the requirements for the degree of Master of Science in Computer Sciences at Massey University, Auckland, New Zealand

dc.contributor.authorAlshehri, Mona Abdulrahman M
dc.date.accessioned2020-09-04T01:54:08Z
dc.date.available2020-09-04T01:54:08Z
dc.date.issued2019
dc.description.abstractThis thesis presents ways of improving the genetic composition, structure and learning strategies for a graph-based evolutionary algorithm, called Genetic Networking Programming with Reinforcement Learning (GNP-RL), particularly when working with multi-agent and dynamic environments. GNP-RL is an improvement over Genetic Programming, allowing for the concise representation of solutions in terms of a networked graph structure and uses RL to further refine the graph solutions. This work has improved GNP-RL by combining three new techniques: Firstly, it has added a reward and punishment scheme as part of its learning strategy that supports constraint conformance, allowing for a more adaptive training of the agent, so that it can learn how to avoid unwanted situations more effectively. Secondly, an optimal search algorithm has been combined in the GNP-RL core to get an accurate analysis of the exploratory environment. Thirdly, a task prioritization technique has been added to the agent’s learning by giving promotional rewards, so they are trained on how to take priority into account when performing tasks. In this thesis, we applied the improved algorithm to the Tile World benchmarking testbed, which is considered as one of the standard complex problems in this domain, having only a sparse training set. Our experiment results show that the proposed algorithm is superior than the best existing variant of the GNP-RL algorithm [1]. We have achieved 86.66% test accuracy on the standard benchmarking dataset [2]. In addition, we have created another benchmarking dataset, similar in complexity to the one proposed in [1], to test the proposed algorithms further, where it achieved a test accuracy of 96.66%; that is 33.66% more accurate.en_US
dc.identifier.urihttp://hdl.handle.net/10179/15591
dc.language.isoenen_US
dc.publisherMassey Universityen_US
dc.subjectGenetic algorithmsen_US
dc.subjectEvolutionary programming (Computer science)en_US
dc.subjectGraph algorithmsen_US
dc.subjectQuerying (Computer science)en_US
dc.subjectArtificial intelligenceen_US
dc.subject.anzsrc460203 Evolutionary computationen
dc.titleGenetic network programming with reinforcement learning and optimal search component : a thesis presented in partial fulfilment of the requirements for the degree of Master of Science in Computer Sciences at Massey University, Auckland, New Zealanden_US
dc.typeThesisen_US
massey.contributor.authorAlshehri, Mona Abdulrahman M
thesis.degree.disciplineComputer Sciencesen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Science (MSc)en_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AlshehriMScThesis.pdf
Size:
3.62 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.32 KB
Format:
Item-specific license agreed upon to submission
Description: