Filter-based reinforcement learning for adaptive optimal control of continuous-time dynamical systems

Kaur, Avneet; Bharti, Moh; Roy, Sayan Basu (Advisor); Banerjee, Shilpak (Advisor)

dc.contributor.author	Kaur, Avneet
dc.contributor.author	Bharti, Moh
dc.contributor.author	Roy, Sayan Basu (Advisor)
dc.contributor.author	Banerjee, Shilpak (Advisor)
dc.date.accessioned	2021-05-21T16:07:55Z
dc.date.available	2021-05-21T16:07:55Z
dc.date.issued	2020-06-02
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/906
dc.description.abstract	The Report describes the Linear Quadratic Regulator(LQR) and the study of the linear systems on different types of systems i.e single agent and multi-agent systems.Different approaches are available for creating an optimal controller for the Linear system where the dynamics of the systems are unknown. One such approach is approximate/adaptive dynamic programming approach(ADP). The algorithm have been implemented for both single and multi-agent system. The algorithm have two level of working one is off-policy data storage and rank checking and the second level is on-policy online reinforcement iterative approach.This paper follows a similar techniques as defined in [1] using dual layer filtering to achieve the optimal control values. In the first layer we are eliminating the state derivative and in the second layer we are iterating under the initial excitation technique to get the optimal values for our dynamical system. The Initial Excitation described in this paper method solve the problem of data storage in the Adaptive Optimal Control(AOC) method and uses a two-level filter approach. The algorithm is designed for single agent system and it was implemented on multi-agent system.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	LQR(Linear Quadratic Regulator),Optimal, adaptive optimal control,filter-based,initial excitation,filtering, Algebraic Riccati Equation,	en_US
dc.title	Filter-based reinforcement learning for adaptive optimal control of continuous-time dynamical systems	en_US
dc.type	Other	en_US