IIIT-Delhi Institutional Repository

Filter-based reinforcement learning for adaptive optimal control of continuous-time dynamical systems

Show simple item record

dc.contributor.author Kaur, Avneet
dc.contributor.author Bharti, Moh
dc.contributor.author Roy, Sayan Basu (Advisor)
dc.contributor.author Banerjee, Shilpak (Advisor)
dc.date.accessioned 2021-05-21T16:07:55Z
dc.date.available 2021-05-21T16:07:55Z
dc.date.issued 2020-06-02
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/906
dc.description.abstract The Report describes the Linear Quadratic Regulator(LQR) and the study of the linear systems on different types of systems i.e single agent and multi-agent systems.Different approaches are available for creating an optimal controller for the Linear system where the dynamics of the systems are unknown. One such approach is approximate/adaptive dynamic programming approach(ADP). The algorithm have been implemented for both single and multi-agent system. The algorithm have two level of working one is off-policy data storage and rank checking and the second level is on-policy online reinforcement iterative approach.This paper follows a similar techniques as defined in [1] using dual layer filtering to achieve the optimal control values. In the first layer we are eliminating the state derivative and in the second layer we are iterating under the initial excitation technique to get the optimal values for our dynamical system. The Initial Excitation described in this paper method solve the problem of data storage in the Adaptive Optimal Control(AOC) method and uses a two-level filter approach. The algorithm is designed for single agent system and it was implemented on multi-agent system. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject LQR(Linear Quadratic Regulator),Optimal, adaptive optimal control,filter-based,initial excitation,filtering, Algebraic Riccati Equation, en_US
dc.title Filter-based reinforcement learning for adaptive optimal control of continuous-time dynamical systems en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account