Thakral, Shubham; Anand, Saket (Advisor); Kaul, Sanjit Krishnan (Advisor)
(IIIT- Delhi, 2021-06)
Most of the Reinforcement Learning(RL) tasks in today's world involve high dimensional data. Training deep reinforcement learning based models is already a very challenging task and varies lot with the choice of hyper ...