dc.contributor.author |
Bhagat, Dhruv |
|
dc.contributor.author |
Mahalwal, Manish |
|
dc.contributor.author |
Sujit, PB (Advisor) |
|
dc.date.accessioned |
2021-05-21T10:10:37Z |
|
dc.date.available |
2021-05-21T10:10:37Z |
|
dc.date.issued |
2020-05-30 |
|
dc.identifier.uri |
http://repository.iiitd.edu.in/xmlui/handle/123456789/898 |
|
dc.description.abstract |
In pursuit-evasion problems, we are presented with one or more pursuers attempting to capture
one or more evaders. We consider the same problem in a bounded and convex physics-based
environment. In our case, the agents are limited by obstacles, the field of view and a maximum
speed limit in the 2D-plane. We propose a Voronoi partition, distance-based cooperative pursuit
in which the pursuers try to minimise the distance, the area of the Voronoi region possessed by
the evader to trap it in a corner. The agents are trained using Proximal Policy Optimization
(PPO) and curriculum learning. By Multi-Agent Asynchronous Training we attempt to train
multiple models parallelly to improve the training process. We simulate this 3D- environment
using Unity. |
en_US |
dc.language.iso |
en_US |
en_US |
dc.publisher |
IIIT-Delhi |
en_US |
dc.subject |
Pursuit-Evasion, Multi-Agent Systems, Reinforcement Learning, PPO, Voronoi, Unity ml-agents |
en_US |
dc.title |
Pursuit-evasion using reinforcement learning |
en_US |
dc.type |
Other |
en_US |