| dc.contributor.author | Bhagat, Dhruv | |
| dc.contributor.author | Mahalwal, Manish | |
| dc.contributor.author | Sujit, PB (Advisor) | |
| dc.date.accessioned | 2021-05-21T10:10:37Z | |
| dc.date.available | 2021-05-21T10:10:37Z | |
| dc.date.issued | 2020-05-30 | |
| dc.identifier.uri | http://repository.iiitd.edu.in/xmlui/handle/123456789/898 | |
| dc.description.abstract | In pursuit-evasion problems, we are presented with one or more pursuers attempting to capture one or more evaders. We consider the same problem in a bounded and convex physics-based environment. In our case, the agents are limited by obstacles, the field of view and a maximum speed limit in the 2D-plane. We propose a Voronoi partition, distance-based cooperative pursuit in which the pursuers try to minimise the distance, the area of the Voronoi region possessed by the evader to trap it in a corner. The agents are trained using Proximal Policy Optimization (PPO) and curriculum learning. By Multi-Agent Asynchronous Training we attempt to train multiple models parallelly to improve the training process. We simulate this 3D- environment using Unity. | en_US |
| dc.language.iso | en_US | en_US |
| dc.publisher | IIIT-Delhi | en_US |
| dc.subject | Pursuit-Evasion, Multi-Agent Systems, Reinforcement Learning, PPO, Voronoi, Unity ml-agents | en_US |
| dc.title | Pursuit-evasion using reinforcement learning | en_US |
| dc.type | Other | en_US |