Pursuit-evasion using reinforcement learning

Bhagat, Dhruv; Mahalwal, Manish; Sujit, PB (Advisor)

dc.contributor.author	Bhagat, Dhruv
dc.contributor.author	Mahalwal, Manish
dc.contributor.author	Sujit, PB (Advisor)
dc.date.accessioned	2021-05-21T10:10:37Z
dc.date.available	2021-05-21T10:10:37Z
dc.date.issued	2020-05-30
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/898
dc.description.abstract	In pursuit-evasion problems, we are presented with one or more pursuers attempting to capture one or more evaders. We consider the same problem in a bounded and convex physics-based environment. In our case, the agents are limited by obstacles, the field of view and a maximum speed limit in the 2D-plane. We propose a Voronoi partition, distance-based cooperative pursuit in which the pursuers try to minimise the distance, the area of the Voronoi region possessed by the evader to trap it in a corner. The agents are trained using Proximal Policy Optimization (PPO) and curriculum learning. By Multi-Agent Asynchronous Training we attempt to train multiple models parallelly to improve the training process. We simulate this 3D- environment using Unity.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	Pursuit-Evasion, Multi-Agent Systems, Reinforcement Learning, PPO, Voronoi, Unity ml-agents	en_US
dc.title	Pursuit-evasion using reinforcement learning	en_US
dc.type	Other	en_US